Y/ 7c? -/<$><! ??L, 

COMMUNICATION THEORY OF QUANTUM SYSTEMS 


HORACE P. H. YUEN 



TECHNICAL REPORT 482 

AUGUST 30, 1971 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY 

RESEARCH LABORATORY OF ELECTRONICS 

CAMBRIDGE, MASSACHUSETTS 02139 


The . Research Laboratory of Electronics is an interdepartmental 
laboratory in which faculty members and graduate students from 
numerous academic departments conduct research. 

The research reported is this document was made possible in 
part by support extended the Massachusetts Institute of Tech- 
nology, Research Laboratory of Electronics, by the JOINT SER- 
VICES ELECTRONICS PROGRAMS (U. S. Army, U. S. Navy, and 
U. S. Air Force) under Contract No. DA 28-043-AMC-02536(E), 
and by the National Aeronautics and Space Administration (Grant 
NGL 22-009-013). 

Requestors having DOD contracts or grants should apply for 
copies of technical reports to the Defense Documentation Center, 
Cameron Station, Alexandria, Virginia 22314; all others should 
apply to the Clearinghouse for Federal Scientific and Technical 
Information, Sills Building, 5285 Port Royal Road, Springfield, 
Virginia 22151. 


THIS DOCUMENT HAS BEEN APPROVED FOR PUBLIC 
RELEASE AND SALE; ITS DISTRIBUTION IS UNLIMITED. 




MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
RESEARCH LABORATORY OF ELECTRONICS 


Technical Report 482 


August 30, 197 1 


COMMUNICATION THEORY OF QUANTUM SYSTEMS 
Horace P. H. Yuen 


Submitted to the Department of Electrical 
Engineering at the Massachusetts Institute 
of Technology in June 1970 in partial ful- 
fillment of the requirements for the degree 
of Doctor of Philosophy. 


(Manuscript received January 1, 1971) 


THIS DOCUMENT HAS BEEN APPROVED FOR PUBLIC 
RELEASE AND SALE; ITS DISTRIBUTION IS UNLIMITED. 




Abstract 


The primary concern in this research is with communication theory problems in- 
corporating quantum effects for optical -frequency applications. Under suitable con- 
, ditions, a unique quantum channel model corresponding to a given classical space -time 
varying linear random channel is established. A procedure is described by which a 
proper density -operator representation applicable to any receiver configuration can be 
constructed directly from the channel output field. Some examples illustrating the 
application of our methods to the development of optical quantum channel representa- 
tions are given. 

Optitnizations of communication system performance under different criteria are 
considered. In particular, certain necessary and sufficient conditions on the optimal 
detector in M-ary quantum signal detection are derived. Some examples are presented. 
Parameter estimation and channel capacity are discussed briefly. 
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Part I. Development of Communication System Models 

A. GENERAL INTRODUCTION AND SUMMARY OF PART I 

The familiar statistical communication theory stemming from the work of Shannon , 1 
2 3 

Kotelnikov, and Wiener is a general mathematical theory. For its application appro- 
priate mathematical models need to be established for the physical sources and channels, 
For frequencies around and below microwave the electromagnetic fields can be accu- 
rately described by classical physics, and the statistical theory can be applied directly 
to channels for such fields. An example is furnished by the study of microwave fading 
dispersive communication systems. 

At higher frequencies, however, quantum effects become important. Even an other- 
wise deterministic signal at the output of the channel has to be replaced by a statistical 
quantum description. Furthermore, a choice among various possible, but mutually 
exclusive, measurements on these signals has to be made to extract the relevant infor- 
mation. Therefore we have to develop communication system models in a proper 
quantum-mechanical manner, and to consider the measurement optimization problem 

that is superimposed upon the existing theories. Since measurement in quantum 

5-9 ' 

theory is of a totally different nature from classical measurement, special physical 

consideration has to be given to the receiver implementation problem. The necessity 
of investigating this class of quantum communication problems springs from recent 
advances in quantum electronics, which indicate that efficient communications at infra- 
red and optical frequencies will be feasible in the future. We shall refer to the usual 
communication theory 10-14 for which quantum effects are neglected as classical com - 
munication theory, in .contradistinction to quantum communication theory . 

The necessity of considering quantized electromagnetic fields for communication 

15 

applications was suggested twenty years ago by Gabor in connection with the finiteness 

of the channel capacity. It was then soon recognized 1 ^ 1 ^ that when the signal frequency 

is high relative to the system temperature, proper quantum treatment has to be given 

in communication analysis. Since the advent of optical masers there has been more 

extensive consideration of quantum communication, beginning with the work of 

Gordon. ’ In the early studies ” attention was concentrated primarily on the 

performance of the system, in particular on the channel capacity, incorporating specific 

receivers of measurement observables. Some generalized measurement schemes have 

37-41 

also been considered. Development of general theories closer in spirit to clas- 

42-44 

sical communication theory was pioneered by Helstrom, who has formulated and 

solved some basic problems in the quantum statistical theory of signal detection and 

estimation. Further significant works of a similar nature are due to Jane W. S. Liu 4 ' 1 ’ 4 ^ 
47 48 49 

and to Personick. ’ A comprehensive review of these studies on optimal quantum 
receivers is available. There are still many unsolved fundamental problems in a general 
quantum communication theory, however, some of which will be treated in this report. 
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As for system modeling, we want to find a specific density operator channel repre- 
sentation for a given communication situation. This problem has not been considered 
in general before. The models that have been used pertain to representations of the 
received fields and are obtained by detailed specific analysis in simple cases, or by 

judicious choice from some standard density operator forms in more complicated situ- 
45 50 

ations. ’ The quantum channel representation for a given classical linear filter 
channel, for example, has not been given. Such relations between the input and output 
signals are needed for formulating problems such as signal design for a given channel- 
receiver structure. A prime objective of our study is to establish a general. procedure 
for setting up such quantum channel representations, with emphasis upon unique or 
canonical quantum correspondents of given classes of classical channels. 

Our work is divided into two relatively independent parts on system modeling and 
system optimization. In Part I we establish a procedure by which various density oper- 
ator channel representations can be written from a given classical channel specification. 
This is achieved by a quantum field description of the communication system parallel 

to the classical description. The problems of transmitter and receiver modeling are 

51 

also considered. Some applications to optical frequency channels are given. In 

Part II we have derived some necessary and sufficient conditions for general optimal 

52 53 

receiver specification in M-ary quantum detection. ’ Estimation and channel- 
capacity problems are also briefly treated. Some examples illustrating the major 
results are given. The study reported here provides the most general existing frame- 
work for quantum communication analysis. 

1 . 1 Summary of Part I 

In Part I we are concerned with the task of establishing quantum- mechanical com- 
munication system models for various communication situations. We shall develop 
quantum channel representations for different transmission media, signaling schemes, 
and receiver classes. These representations are clearly prerequisites of a detailed 
system analysis. In particular, we want to find, under reasonable assumptions, a 
canonical quantum channel model corresponding to a given classical specification. A 
general procedure that yields the quantum channel characterization for a broad class 
of systems through the classical characterization will be described. We shall give a 
preliminary discussion on the purpose and nature of our theory. 

1. 2 Relation to Previous Work 

The development of quantum communication system models has not been considered 
in general before. In previous work on quantum communication the received fields have 
been considered directly. The receiver is usually taken to be a lossless cavity 

that captures the incoming field during the signaling interval. The desired quantum 
measurement can then be made on the cavity field, which is represented in a modal 
expansion in terms of orthonormal spatial-temporal mode functions. While such a 


2 


model of the received field can be useful, it is not sufficient for describing general com- 
munication systems. 

In the first place, a density operator representation for the cavity field modes may 
not describe all possible receivers. Second, the connection of the cavity field with the 
channel output field is unclear. The most important point, however, is that without 
knowledge of the channel output field commutator there is no way to accurately determine 
the cavity field density operator representation in general. Such a commutator, of 
course, is closely related to the channel properties. Thus a more detailed consideration 
is required to develop the receiver input density operator representations for the entire 
communication system. Furthermore, general relationships between the input signals 
and the output fields are needed for formulating problems such as signal design. 

Our theory gives a general quantum description of communication systems including 
the channel, the transmitter, and the receiver. We shall develop a procedure by which 
a proper density operator representation can be constructed from the channel output 
field directly for any receiver configuration. The communication system will be 
described quantum-mechanically in away that parallels the usual approach in classical 
communication theory. The complete quantum description of the channel output field 
will be given in terms of the signal and channel characterizations. While certain 
assumptions are made in our development, only given classical information will be used 
to supply the corresponding quantum information needed for a complete description of 
the communication system. 

1.3 Nature of Our Theory 

We restrict ourselves to communication systems that are described classically by 
randomly space-time-variant linear channels. We need to develop a quantum descrip- 
tion for such systems, and for this purpose some explicit physical consideration is 
required. We frequently invoke the explicit physical nature of the signals as electro- 
magnetic fields, and regard the "channel" as the medium for field transmission. We 
also make the important assumption that the field propagation is described by linear 
equations. 

A description of communication systems from the viewpoint of classical random 

field propagation is discussed first. To develop the corresponding quantum theory, we 

need to establish certain concepts and results in quantum random processes. A 

development of linear quantum field propagation can then be given. When a classical 

54-59 

channel is specified as a generally random space-time-variant linear filter we 

shall regard its impulse response as the Green's function of a stochastic differential 
equation 5 ^” describing signal transmission. Our theory then gives a quantum descrip- 
tion of such a situation, and can therefore be viewed alternatively as a procedure for 
quantization of linear stochastic systems. Having obtained a quantum specification of 
the channel output field, we shall establish the procedure by which density operator 
representations can be constructed for realistic receiver configurations. 
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A most important purpose of our analysis is to give, under certain assumptions, the 
unique quantum system specification from the usual given classical specification. That 
assumptions are necessary in general should be apparent if we recall that quantum 
classical correspondences are frequently many to one. The utility of such a unique 
quantum classical correspondence is that we do not then need to analyze each communi- 
cation situation anew, and can directly obtain the quantum characterization from the 
classical one without further reference to how the classical characterization was 
obtained. Such an approach is convenient and yields useful quantum models comparable 
to the classical ones. It can be applied without detailed knowledge of quantum theory. 

We shall give further discussions of these points when appropriate. 

1.4 Background 

While the specific theory presented here appears to be novel, it has significant roots 
in both classical random processes and quantum statistics. 

Our system characterizations, for the most, part, are given by state -variable dif- 
ferential equation descriptions as the laws governing field transmission. This mathe- 
matical treatment of classical stochastic systems is well known in the lumped- parameter 
case,^ 0- ^ 5 and can be extended immediately to distributive systems. Similar treatment 
of quantum stochastic systems leans heavily on the works of Lax. ” In Section C we 
give a self-contained development of quantum random processes which is essential for 
our later treatment. To establish the quantum classical correspondence, we also need 
some generalized fluctuation dissipation theorems that will be discussed in the main 
text and in Appendix C. 

We shall employ noncovariant quantum fields throughout our treatment which are 

74 75 

discussed in many places (see, for example, Louisell and Heitler ). A brief descrip- 
tion of the mathematical framework of quantum theory is given in Appendix A. 

1 . 5 Outline of Part I 

In Section B we discuss classical communication from the viewpoint of random-field 
propagation. The system characterization is given in the relatively unusual differential 
equation form, which is suitable for transition to quantum theory. The concept of a 
random Green's function of a stochastic differential equation is introduced. The rela- 
tionship of our description to a more common one is discussed. It should be noted that 
many features of this classical description are retained in the quantum domain. 

In Section C we give a systematic treatment of quantum probabilities and quantum 
stochastic processes. The important notion of a Gaussian quantum process that is fun- 
damental to much of our later discussions is introduced. New consideration is also 
given to the problem of summing independent quantum observables, and to the possibility 
of Karhunen-Lobve expansion for quantum processes. This material may be useful in 
treatments of other quantum statistical problems. 

In Section D we develop the theory of linear quantum field propagation paralleling 
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the classical development of Section B. A general characterization for Gaussian quan- 
tum field is given. The necessity of introducing quantum noise in extending the classical 
treatment to the quantum area is explicitly shown. The general problem of quantum 
classical channel correspondence is formulated and discussed. Under Markovian or 
stationary situations the resulting quantum system characterization is related to the 
classical one through the fluctuation-dissipation theorems, which specify the channel 
output field commutator. 

In Section E a canonical quantum channel representation applying to any transmitter- 
receiver configuration is given. Possible methods for obtaining other representations 
are also discussed. The different resulting representations are considered and com- 
pared from several viewpoints. Emphasis is placed on the flexibility of our procedure 
for achieving convenient models. Generalization of the results to stochastic channels 
is discussed and detailed. Stochastic signals are considered. The entire communica- 
tion system is then treated in a unified manner with a combined representation. 

In Section F we discuss the quantum system models of some typical optical channels. 
The representations of radiative loss and dissipative channels are contrasted and simple 
treatments for the atmospheric and scattering channels are given. An optical trans- 
mission line is also considered from a basic physical description. 

In Section G a detailed summary of .the results of Part I is given. Suggestions are 
made for further work on some outstanding unsolved problems. 
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B. CLASSICAL RANDOM FIELD PROPAGATION 
AND COMMUNICATION SYSTEMS 


We begin our development by considering the theory of classical random field propa- 
gation and the description of communication systems from this viewpoint. The most 
important point is that our quantum analysis will be carried out in a framework exactly 
analogous to the treatment considered here. Our quantum classical channel correspon- 
dence will also be established through the following differential equation representations. 
Furthermore, many features of our present classical description will be preserved in 
the quantum treatment. 

The introduction of a physical field description for communication systems is not 

50 51 78 82 83 84 

new. In classical analysis of optical channels ’ and of reverberations, ’ 

the distributive character of the signals is also considered. Our approach is quite dif- 
ferent from these works, however. 

We shall now start consideration of the channel, by which we mean the medium for 
signal transmission. No modulation and coding will be discussed; instead we consider 
the channel outputs and inputs directly. The information-carrying signals are space - 
time dependent electromagnetic radiation fields that travel from a certain space-time 
region through the medium to a distant region. The channel should therefore be char- 
acterized in terms of the equations that govern electromagnetic field propagation. 

In general, the channel introduces irreversible random transformations on the 
signals. Channel distortion and noise will be included in the dynamical equations 
as random driving forces or random coefficients. Our channel is thus generally a 
space-time dependent stochastic system. Such a characterization can be used to define 
the transition probability in the conventional description, as we shall see eventually. 
Throughout we assume, for simplicity, that depolarization effects of the transmission 
medium can be neglected. Furthermore, we consider only one polarization component 
so that we have a scalar rather than a vector field problem. 

2. 1 Partial Differential Equation Representation of Channels 

Our communication channel is specified by the equations of electromagnetic field 
transmission through a given medium. To give a general description, let us con- 
sider a fundamental scalar field variable i)j(r, t) which can be complex and from which 
the electric and magnetic fields are obtained by linear operations. The precise nature 
of ijj(r , t) does not need to be specified yet. Let the dynamical equation describing 
the channel be of the form 

££ «Mr, t) = E(r, t) + 3F (r, t), (1) 

where ^ is a random space-time varying partial differential operator with respect 
to t and the components of r, E(r,t) is the deterministic excitation, andj^~(r,t) is 
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a random-noise driving field with zero average 
< & (r,t)> = 0. 

We use the vector r to denote collectively the chosen space coordinates, and t is the 
time coordinate. 

When ijj(r,t) is complex we also need to consider the equation 

= E*(F,t)+ (2) 

where .Sf^ is the adjoint of the operator . The star notation means that the complex 
conjugate of the quantity is to be taken. The noise source ^"(r.t) is generally assumed 
to be a Gaussian random field. 

In general, -£f can be a nonlinear random operator. We shall always make the 

important assumption that if is a linear operator. We first consider the case wherein 

if is nonrandom but possibly space -time varying. Stochastic properties of if will be 

introduced later. The channel therefore becomes a spatial-temporal linear filter. All 

85 

relevant quantities are also allowed to be generalized functions including generalized 
86 

random processes, and suitable restrictions are assumed to insure the validity of the 
operations. 

Let the domain of our i|j(r,t) be the set of square integrable functions, 

J v 4M?, t) +(r,t) drdt < 

for integration over the space -time region V of interest. Every such function can be 

87 

expanded in the product form 

+(r.t)= Z <j> k (?)P k (t) (3) 

k 


= Z 
k, n 




(4) 


In order to insure that the distributive system can be conveniently separated into 
an infinite set of lumped parameter systems, or that the method of separation of vari- 
ables can be applied, we let 

^=^ 1+ ^ 2 , (5) 

where if ^ is an ordinary differential operator with respect to t, and if is one with 

respect to the components of r. Both if^ and if^ are presumed to possess a complete 

set of orthonormal eigenfunctions in their respective domain with appropriate boundary 

87 

conditions and definition of inner product. Equation 5 is then equivalent to the condition 
that if possesses eigenfunctions separable in space and time arguments; that is. 
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( 6 ) 


°^\n (r,t) 


0 d> 
Kn v kn 


<r,t) 


4> kn (r,t) = ^k (r) y n (t) ' (7) 

The assumption (5) simplifies analysis without being at the same time a severe restric- 
tion. In fact, in electromagnetic theory, wave equations that mix space and time are 
rarely encountered, if at all. 


2. 1. 1 General Case 


With the decomposition (5) we can generally expand i|j(r,t) in the form (3) with cj> k (r) 
being the normalized eigenfunctions of & ’ for the boundary condition of interest. 

= Vk (?) < 8 >. 

Jy *k< ? > w 2 (?) dr = 6 kk. (9) 


Z 4> k (r) 4»*(r') W 2 (r') = 6(r-r'). (10) 

k ■ 


The spatial region under consideration is denoted by V^. Here 6 kk , is the Kronecker 

delta, and 6(r-r') is the Dirac delta function. The inner product between two eigen- 

87 89 

functions is defined in general with respect to a possible weight function W 2 (r). 

The weight functions are usually required when the solution of the differential equa- 
tion attenuates. In our case they will occur if there is spatial dissipation in the prop- 
agation. Such spatial dissipation will arise when j5f 2 involves odd spatial derivatives 
•in the wave equation of the electric or the magnetic field — a situation that is unlikely 
to occur for electromagnetic field transmission obeying Maxwell's equations. In partic- 
ular, if the loss arises from a conductivity that is only frequency-dependent, odd- 
time rather than space derivatives appear in the wave equation. Therefore we assume 
for convenience throughout our treatment that N 

W 2 (r) =1. (11) 

. A more general discussion relaxing condition (11) is given in Appendix D. 

Since ^~(r,t) is taken to be Gaussian, it is completely specified by the covariances 
_ _ A _ _ 

<^( r,t).F(r't')> = C (rt;r't') (12) 

< JMF.tJJHF't')) = C (rt; r't' ). (13) 

Assuming that every sample function of ^”(r,t) is square -integr able, we can 
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generally expand*^ 

f(F,t)=M k (?)f k (t) (14) 

k 

for a set of nonrandom functions {<t > j c ( r )} defined by (8)-(10) and another set of Gaussian 
stochastic processes In general, the {f^(t)} are mutually dependent with sta- 

tistics specified through (12)-( 13). If we also write 

E(F.t) = Z e k (t) <j> k (F), (15) 

k 

Eq. 1 is reduced to the following set of ordinary differential equations 

(X -k +if 1 ) P k (t)= e k (t)+ f k (t). (16) 

These {(3^(t) } define our field ^(r,t) completely and will be referred to as the time- 
dependent spatial mode amplitudes or simply amplitudes. 

We shall assume in general that the noise source is diagonal in <j>, (r). That is, 

I C^ jr ( M ,, +k (?) V r. 1 , - (17) 

and 

C • (rt;r't') = I (t, f) +.(?) 4>(r'). (18) 

P P k & & K K 

This assumption is discussed briefly in Appendix D. It holds when the noise source is 
spatially white, that is, 6-correlated in space. Under (17) and (18), the {f^(t) } becomes 
statistically independent with 


<f k (t)> = 0 


(19) 

<f k (t)f k ,(t')> = 

•k* '>,<*■ *■> 

(20) 

< f k (t) f k*( t *)> = 

kk' ^-*^-( t , t ' ). 

(21) 


Note that when (20)-(22) are k-independent the noise field ^"(r,t) will be spa- 
tially white. We shall refer to the system (16) with statistics (19)-(21) as our 
"general case," although still more general situations are also discussed in Appen- 
dix D. 

We now proceed to investigate the properties of (16)-(21).. We treat stochastic 
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integrals, etc. , in the mean-square sense. . No attention is paid to strict formal rigor. 

Careful treatments can be found elsewhere. ^ 58,60,91-94 

87 - 89 

Let h^(t,r) be the Green's function 7 of ^ , 

(\ + &\ ] h k (t,T> = 6(t_T) (22) 

with the initial conditions 


3 P h(t, T ) 


at 


p 



p = 0,1,..., n-2 


(23) 


9 n 1 (t , t ) 


at 


n- 1 


t=T , 


& (t) 


when X.. + .Sf, has the form 
k 1 


,n ,n-t 

d . _ d 


\ + *^1 = a o (t) + a j(t) + . . 

k l o dt n i dt n i 


- a „-i <tl a + %<*>• 


(24) 


(25) 


We will assume for simplicity throughout our work that a o (t) = 1. We set 

h k (t,T)=0, t < t. (26) 

95 

This Green's function h k (t,r)is also the zero-state impulse response of the differential 
system described by (16). The zero-state response p^(t ) for an input e k (t), 

(\+- Sf 'i ,f3 k (t) = e k (t)l 


can therefore be written 


P k (t)= h k (t>T) e k (T) dT ’ 
o 


where e, (t) is started from t = t , and the initial state, 
k o’ 


f>k 





t 

o 


dt 


a n_1 

TI P k <« 


t 

o 


(27) 


is taken to vanish. When (27) is not zero we can include them in the differential equa- 

87 

tion (16) as sources by the so-called extended definition of (^+ ) P^ft) for non- 

homogeneous initial conditions. In general, for the form (25) we should enter as sources 
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on the right-hand side of (16) 


n 
Z 
r= 1 


n 

Z 

h'=l 


,, \ Q n'-r., 
a ,(t ) B, (t 

n-n 1 ' o' r k ' 


) 6 r ^t-t ). 
o' ' o' 


( 28 ) 


We have used superscripts to denote derivatives with respect to the argument of the func- 
tion. Thus (28) involves higher derivatives of the delta function. 

The output P k (t) of (16) for arbitrary initial conditions can therefore be written down 
with h^(t,r) alone. 


n n ,r-l 

p (t )= z z (-if ^rjh^t.x) 
r=l n'= 1 dr 


a ,(t ) pP' r (t ) 
n-n' o"k 'o' 


+ J* h k (t, t) e k (r) dr + h k (t, t ) f k (r ) dr. ( 29 ) 

o 

With ( 1 9 )— (2 1 ), the P k (t) are also independent Gaussian processes if the initial distribu- 
tion for (27) is also jointly Gaussian and independent for different k. In general, the 
statistics of (27) are assumed to be independent of those of { f k ( t ) }. 

In many cases, however, it is reasonable to assume that the initial state (27) arises 

from the noise sources f, (r) before the signal is applied at t . Thus if we split the 

K O 

usually non-white additive noise into two parts 

n k (t)= J-oo h k ( t > T > f k (T) dT (30) 


h k (t,r) f k (T) dT + S° K h k (t,T) f k (T) dT, 


(31) 


we can make the replacement 


r= 1 n'= 1 


dr 


,n'-r 


1 


T — t 


(32) 


In this case the first term on the right of (29) can be taken to be zero. We then just need 
to consider 


P k (t) = h k (t,T) e k (r) dT + n k^ 


(33) 


without further reference to initial conditions. 

It is clear, that the output statistics for P k (t) are now fully defined through h^t.T), 
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and the statistics of f k (t) are §i ven by ( 19 )-( 21 ). We can also form an arbitrary set of 
linear functionals of {p^(t)} which will be jointly Gaussian with statistics determined 
accordingly. 

It is important to point out that the noise source ^"(r,t) or f^(t) in our differential 
equation description is a thermal noise associated with the filter system. It is possible 
to have other independent noise added to >Jj(r,t) or (3 k (t). Noises from different sources 
can clearly be treated together in a straightforward way. 

2. 1. 2 Markov Case 

With a particular choice of 4 J (r' , t ) it may be possible under some approximations 
to have 

C^^t.t') = 2K k (t) 6(t-t«) (34) 

C k . (t, t<) - 2K k (t) 6(t-t') (35) 

jr * 

for the corresponding noise source ^"(r,t). In this case the {f, (t) } become inde- 

6 0 65 

pendent white noises so that each P k (t) is a component of a vector Markov process. 

This Markov vector process is formed by p k (t) and its higher derivatives. 

With the same approximation that leads to (34) and (35) one frequently also finds 
that J*? j only involves first time derivatives. Thus P k (t) becomes a Markov process 
by itself. For simplicity of presentation, we shall mainly consider this case instead 
of the vector Markov one. In Appendix B the vector Markov case is treated. As 
we only look at the variables {p^(t)}and their complex conjugates, the vector Markov 
case leads to results that are also similar to those obtained in the strict Markov case. 
This point is made explicit in Appendix B. 

We therefore consider the first-order differential equation for each k. 


(X k +if 1 )P k (t)= e k (t)+ f k (t) (36) 

<f k (t))=0 (37) 

<f k (t) f k ,(f» = 6 kk , 2K k (t) 6(t— t') (38) 

<f k (t) f k ,(t')> = 6 kk ,2K k (t) 6(t-t'). (39) 


k k 

The functions K^t) and K,,(t) are commonly called diffusion coefficients . In such 
a representation the P k (t) are frequently complex so that we use the following 
vector and matrix notations when convenient. 
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and write 

( k k + ^i )(3 k (t) = £ k (t) + 4 (t) - (40) 

For this complex P k (t) case it is more appropriate to consider {P k (t), P k (t)} as a jointly 

Markov process. To distinguish from the ^vector Markov case discussed above, we shall 

not refer to P k (t) as a Markov vector process. 

We shall now give a brief quantitative development that will be used in our later 

work. Further details may be found, for example, in the work of several authors. ^"^5 

6 ) 2 

In the present development, we follow closely Helstrom, but also derive some other 

results of importance to us. For our situation of interest it is more convenient to adopt 

the Langevin-Stratonovich^ 0 ’^’^'’ or Ito^ viewpoints rather than the Fokker -Planck - 

Kolmogorov^’ ^ one> They are fully equivalent,^ 4 in our case, however. 

9 5 

We define the state transition matrix, h k (t,-r) of Eq. 40, by 

(X.+ifMh, (t,r) = 0, t > t (41) 

1 fv K 

under the initial condition 

\(T,T)=I. (42) 

Here 
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and, for simplicity, we have taken the coefficient of d/dt in ’ to be unity. This transi- 


tion matrix h, (t,r) is then 

~K 


£k (t ’ T) = 


h k (t,r) 


(43) 


h k (t>T) y 


where, for t > t, h k (t,-r) is the same as the zero-state impulse response of (36). 
We define the additive noise vector 

2 k (t >= fl„ ik (t ’ T) 4 (T) dT 


(44) 


= L°oo h k (t,T)i k (r) dr + n^t.t). 


(45) 


where the signals e k (t) are again assumed to be turned on at t = t. We can write as 
a particular case of ( 29 ) 


£k (,, = !!k (t ' i o , iyy + s k (*.u«k(T)drt a yt,t ). 

O 

The conditional variance 

» k <‘.y = tk <t ’ T| -k (T)dT l 


(46) 


X [£k (t)- !ik (t ’ t o ) £k* t) "' ^t ~k^ t,T) ?k (r)dT ^ T ^ 


(47) 


is therefore 


Jk<‘- t „» = <Sk< t ’ t o | 2P*. t „> >■ 


(48) 


where T denotes the transpose of a matrix 


( a L = (a)--. 
~ ij ~ J 1 


If we further define the covariance 


£ k (t,t«) - <[P k (t) ~ ^( t . T )£k (T)dT ][Pk (t,) ‘ ! l !ik( t,T )£k< T)dT ] T )’ (49) 


we have 

£ k (t,t') = h k (t ’ t,) ik (t '’ t ' )l 


t > t* 


(50) 
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t > t' . 


(51) 


*k(t. f ) = ^ k (t, t) - h k (t, f ) £ k (t' , V ) h k (t, f ), 
It is also convenient to set 

<f k (t)f k (t')> = 2D fc (t) 6(t-t') 

with 




Kj(t) 

K 2<‘) 

K 2<*> 

Kf(t) 


so that 


t > t'. 


* k (t,t') ='2 /J, h k (t,r) D k (r) h k (t, T ) dr, 

We next assume that the system is stable. That is, 


lim h k (t, t' ) = 0. 
t' — -oo ~ 


From (47) and (55) we therefore have 
$ k (M) = £k (t,_00) 


(52) 


(53) 


(54) 


(55) 


= 2 /^ oo h k (t,T)p k (T)hJ{t, T ) dr. (56) 

As discussed in the general case, we see from (56) that in this Markov case the vari- 
ance i k (t t ) arises from the random force f k (t) for a given D k (t). The initial sta- 
tistics of P k (t) can therefore be specified as a Gaussian distribution with zero mean and 
variance 



Now it can be readily shown from Eqs. 51, 52, 45, and 57 that 

£ k (M) = <Iik (t) -k (t) ' > - 

Together with (50) this justifies the representation 

£ k (t)=/J ^k (T) dT + Sk (t) 


(57) 


(58) 


(59) 
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without further reference to the initial condition (3, (t ). 

In any case, it is most important to observe that the statistics of the process P k (t) 
are completely specified by h k (t,r) and D k (t), or equivalently by h k (t,x) and <|> k (t,t). By 
differentiating (56), we arrive at 

^£k (t,t) = 2 Pk (t,_ ^k (t, 4k (t - t) -ik (t ’ t) ^k (t) (60) 

if we write 

<ik + £1 > tk<‘> - [l m* £k<*>] &<*> = o- " » 

97 

Equation 60 is a special case of the well-studied matrix Ricatti equation. We call (50) 
and (60) the fluctuation-dissipation theorems for the process P k (t). 

From our viewpoint, the substance of a fluctuation-dissipation theorem is to relate 
one- and two-time statistics of a process in a simple, convenient, but nontrival way. 
Such a relation can intuitively be seen to exist for a Markov process (or a component 
of a Markov process obeying a different equation with white driving noise). If A k (t) or 
h, (t,r) is interpreted as dissipative, we understand why the theorem connects dissipa- 

<vK 

tion to the fluctuation. Thus given the impulse response h k (t, r), we only need 
to know the one-time <|> k (t, t) or D k (t) to give the two-time covariance ^.(t.x) and 
( l k (t) fj^(t' )). When the system is time -invariant, in that 

A k (t) a A k independent of time 

with a stationary driving force 

D k (t) = D k independent of time, 

we see from (56) that <j> k (t,t) is independent of t and the process P k (t) becomes also sta- 
tionary. The statistics in this equation is even specified by just h k (t— r) and a con- 
stant D k or A k * 

Under our Gaussian assumption the fluctuation-dissipation theorems allow us to 
specify the complete process by the mean response h k (t,-r) and the one-time behavior 
<j> (t , t). In our later treatment of quantum-classical system correspondence, the one- 
time classical behavior will be connected with the quantum behavior system from 
thermal-noise representations. With fluctuation-dissipation theorems of this nature we 
shall then have also established a complete correspondence. 

Although we can write down the transition probability 

hXwiPk'Vo) 

which defines completely the Markov processes, such explicit equations and other 
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details are omitted here because they can be obtained straightforwardly in case we need 
them. 

State -variable Markov process representations have been used for communication 
98-103 

application before. In contrast to previous cases, we use Markov processes 

strictly for channel representations. Furthermore, we attach physical interpreta- 
tions to these representations as the equations derived from basic laws of physics that 
govern field transmission. 

2. 1. 3 Stationary Case 

It may occasionally be unsatisfactory to use a Markov approximation like the one 
discussed above. In this case the force yt) cannot be taken to be white at all. In gen- 
eral, there will then be no fluctuation-dissipation theorems for an arbitrary Gaussian 
process. For the particular case of a stationary system, however, such theorems 
do exist 1 04-107, 70 anc j be described below. 

Let the equation for p ) be 

(\ k +-^ 1 )P k (t)= f k (t), (62) 

with 

<f k (t)> = 0 (63) 

<f k (t) yt')> = 6 kk ,C k (t-t<). (64) 

Here we have taken p k (t) to be real, since it is more appropriate to consider directly 
the electric and magnetic fields in such situations. The driving noise source yt) is 
stationary, and is assumed to be time -invariant. We can then write (62)in the Fourier 
representation 

<S ? k (w) P k (co) = f k (w) - (65) 

where 

A(oo) = fZo e 1 ^ A (t) dt , • ( 66 > 

for a process A(t). We also define 

<A(w)B*(«)> = Cdte-^<A(0, B *(t,) (67) 

for any two processes A(t) and B(t). 

th 

If we now take P k (t) to be the electric-field amplitude for the k mode and assume 
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that the fields are in thermal equilibrium with an environment at temperature T, we 
have 


2k T I 

<f*( w )f k M> = <\(o»)f*(«)> = 


k 1 


( 68 ) 


- 2k T 

<P k ( W )P k (co)) = ~ Im 




(69) 


Here k-, is Boltzmann's constant, and (£ (o>) is the imaginary part of (go), 


se k M = + iif^co). 

Thus the correlation of f^(t ) is determined completely by X_ k + ^ and the system tem- 

perature T. The interpretation of Eqs. 68 and 69 as a fluctuation-dissipation theo- 
rem is obvious. 

The utility of such a theorem for our research has already been discussed. In 
our later quantum treatment we shall further elaborate on the nature of Eqs. 68 and 
69 and its application to our problem. 

Fluctuation-dissipation theorems for fields in all of our cases can be obtained 
by combining the results for mode amplitudes in a series expansion. They will be dis- 
cussed in Section I-D. 

93 

Other classes of random processes, for example, martingales, also admit two- 
time statistical characterization by one-time informations. It is more appropriate, 
however, to consider the problem from a physical Hamiltonian point of view. Such con- 
sideration will be touched upon in discussing quantum development and in Appendix C. 

It should again be emphasized, before we leave the differential equation characteri- 
zation, that our driving noise source is always the thermal noise associated with the 
system. Other noise is presumably additive to the fields ^(r, t). 


2. 2 Nondifferential Filter Channels 

It is possible that in a given specification of a channel in terms of a space -time filter 
the system cannot be interpreted as a differential one. We use the terminology "nondif- 
ferential filter" to indicate for certainty that a corresponding differential equation does 

95 

not exist, in contrast to some previous usage. Although realization theories of linear 

1 08 

dynamical systems do exist, they do not seem to be directly applicable to our situa- 
tions. The difficulty is that we do not know, strictly speaking, the order of the differ- 
ential equation representing our system. In many cases a nondifferential system whose 

impulse response is a reasonably well-behaved function can be approximated arbitrarily 

95 

closely, in the sense of zero-state equivalence, by a differential system of sufficiently 
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high order. Clearly, there exist filters that do not admit of a differential representa- 
tion. A case of frequent occurrence is the multiplicative situation 

h(t,x) = A(t) 6(t-T ) 


or 

G(rt; r'-r) = A(rt) 6(t— r ) 6(r-r'). 

The noise is then usually specified by an additive component N(r, t), 

4<(rt) = / G(rt; r *t * ) E(r' , t' ) dr'dt' + N(r, t) (70) 

with excitation E(r',t'). In this case it is more appropriate to consider the channel input 
and output as related by 

if/(rt) = A(rt) / G f (rt;r't') E(r',t') dr'dt'. 

That is, the input field after propagation over a space-time filter described by G^rtjr't') 
is multiplied at the output by A(r , t). The question then becomes whether 

G(rt; r't' ) = A(r,t) G f (rt;.r't') 

can be interpreted as a Green's function of a partial differential equation, if we suppose 
that Gj(rt;r't') can be. Further consideration of this will be given later. 

2. 3 System Normal and Noise Normal Modes 

It is now convenient to introduce the concept of system normal modes and noise nor- 

r) of ^2 as system space-normal 

modes and eigenfunctions y (t) of as the system time-normal modes. The product 

<j> (r)y (t) are the space-time normal modes. These system normal modes can be con- 
k n 

trasted with the noise normal modes <j>' (r) and y' (t). Here the Gaussian noise source 

k n 

is expanded as 


mal modes. We refer to the eigenfunctions (j>^( 


J r (?,t)= Z f icn <f> k (r) yn (t) (71) 

kn 

= Z *k (? ) f k (t >, (?2) 

k 

where f, are statistically independent random variables, and f (t) are independent ran- 
kn a go 

dom processes. Equations 71 and 72 are Karhunen-Lofeve type expansions for a' 

Gaussian random field with square -integr able sample functions. This system and noise 
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normal mode terminology would occasionally be abbreviated as "system" and "noise" 
modes. 

The statistical dynamical problem is completely diagonalized if the system normal 
modes coincide with the noise normal modes. When they differ the use of system normal 
modes implies that the noise components for them are not independent, and the use of 
noise modes implies that these modes are coupled. In case * s dissipative, a white 
driving noise field ^~(r,t) expanded in the system normal modes, 

f(?,t)= Z ^ k (?)f k (t), 
k 

would give rise to statistically dependent f^(t). If we use noise modes 4>M?), we would 
obtain a linear system of coupled differential equations for with independent 

driving processes {f^(t)}. A particular choice of simplicity can be based on individual 
problems and individual questions. Our assumption, Eq. 1 1 , permits our system and 

noise normal modes to be the same even when ^”(r,t) is white. 

87-89 

The Green's function for the partial differential equation " Eq. 1 with the condi- 
tion Eq. 5, the boundary conditions of ^(r), and vanishing initial conditions can be 
written in general 


G(rt; r't') 


l ;r-bcr' |, k (F| ' + k (f,,w 2 (F ' 1 *!<*'»■ 

nk n k 


(73) 


where 



y (t) = v y (t) 

J n' n n 


(74) 


( v 1 V% |,|w iW d J , = i tf < 75 >. 

2 y*(t)y (f) w i(t')= 6 (t-t'). (76) 

n 

In this case is the time interval of the problem, and W^(t) is generally not unity. The 
additive noise field 

F(r , t) = / G(rt; r't') jF<r' , t< ) dr' dt (77) 

corresponding to a white driving noise source •^’(r'.t 1 ) then also possesses normal modes 

different from ck (r) and y (t) in general. 
k n 

The discussion on system and noise normal modes that we have just given carries 
over straightforwardly in the quantum treatment. 
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2. 4 Stochastic Channels 


In a nondifferential filter characterization of channels, the filter can be taken 
as a random process. For example, in the equation 

X(t) = / h(t,T) e(r) dr + n(t) (78) 

we can specify the two-dimensional process h(t,-r), and independently the noise process 

4 109 

n(t). Such channels have been the subject of much study, ’ and many different useful 

54 55 4 

characterizations are available. ’ ’ 

In the case of differential equation representation, we are considering a stochastic 
differential equation 

(i?’ 1 + ^ 2 ) «Mr,t) = E(r,t) + (r,t) (79) 

whose -Sf, and Jzf. are now linear random operators. The study of such an operator is 

* 56-59 

a relatively difficult subject, and very few analytical results are available. Var- 

ious approximations usually have to be made. 

2. 4. 1 Random Green's Function 

For our purpose, it is convenient to introduce, parallel to the nonrandom case, the 
concept of a random Green's function. By this we mean that under the deterministic 
boundary condition prescribed previously, the solution +(r,t) of (79) can be written 

Wr.t) = / y G k (rt;r't') [E(r't')+^(r',t')] dF'dt' (80) 

for a four-dimensional random field 

G R (?t;?'t') (81) 

which we call the random or stochastic Green's function of (79). Thus G T3 (rt; r't') is 
the inverse of the random operator 3 ? . Note that the term stochastic Green's func- 
tion has been used before with a totally different meaning. 110 

The crux of the statistical problem is then of course the determination of properties 
of G R (rt; r"t')from (79) under various mathematically specified conditions. Such a task 
appears to be quite difficult for even a simple equation (79). It is not clear that such 
an approach to Eq. 79 is the most fruitful one in general. Our discussion of a communi- 
cation system would be greatly simplified, however, to a level comparable to the non- 
differential case, if we had such a random Green's function that might be obtained 
from various approximations. We shall see immediately that G R (rt; r't' ) is at least 
a powerful theoretical tool in our communication system analysis. 
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Similar to the deterministic case, we have the problem of realizing an integral std- 
chastic channel representation by a stochastic partial differential equation or random 
Green's function. The difficulty here is more severe; the deterministic case and further 
approximations will usually be required. 

2. 4. 2 Stochastic Normal Modes 

We now assume that a stochastic Green’s function of the kind discussed above has 
been given which specifies the channel output in the absence of other noises for a fixed 
input signal. To avoid part of the difficulty in connection with stochastic differential 
equations mentioned above, we can regard as given a random field (81) which is the sto- 
chastic Green's function of a certain random differential equation. We are not able to 
tell whether this can indeed happen for a given G^(rt; r’t'). For our interest this dif- 
ficulty should not be too serious. 

We can formally expand the random Green's function of (81) in the form 

G R (rt; r't') = Z ^(r) «£(?') ^(t, t') (82) 

k 

for a set of orthonormal functions 4> k (r) and a set of random processes h^(t,t'). The 
expansion (82) is equivalent to the assertion that the possibly random possesses 
orthonormal eigenfunctions <(> k (r) with random eigenvalues. If is nonrandom, (82) 
is clearly valid. The process h^t.t') can be expanded as before: 

»*<*•*'>= E *„<*’> W l (t '> (83) 

mn 

for functions y (t) obeying Eqs. 75 and 76 and random variables {g^ ln }- The set { z m (t)} 
is another sequence of orthonormal functions. The stochastic Green's function can then 
be written in a spectral representation 

G R (rt;r't»)= Z gjl 2 (t) y (f) <f> k (F) «£(r') W^f). (84) 

kmn 

In general G„(rt; r't') can therefore be conveniently specified by the joint distribution 

of{g k } 
lb mn J 

Let us define the mean and covariance of h^(t,t') by 


h k (t,-r) = h k (t, T ) 


(85) 


h k (t,r)-h^(t,T) 


[h k (r,s)-h k (r, s)] 


C k * (tr; ts) 
h h 


( 86 ) 


22 



although higher correlations may also exist in general. We have used the bar to indicate 
stochastic channel averaging, in distinction to the angular bracket notation for noise 
averaging. It is frequently possible to set 

C k * Jtrirs) = c£*(tr;TS) = 0. (88) 

hV hh 

In such a case the expansion (84) can be taken as a Karhunen-Lobve expansion with 
uncorrelated {g^ n } for different {m,n}. This possibility is evident if z m (t) Y n (t' ) is an 
eigenfunction of the integral equation 

/ C k „ (tr; ts) z (r) y (s) dTds = a k z (t) y (r). (89) 

h h 

r k n 

If (86) is nonvanishing, it is generally not possible for {g mn /f° be uncorrelated. That 
is, it is not possible for 


z k* k , 

(gm'n' g mn^ = 0 

m =£ m', 

, n 4 n 

<g m 'n' g mn^ = 0 

m m', 

, n * n 1 


to hold together, since we are effectively trying to diagonalize two different processes 
14 

simultaneously. If h^(t,t') is real, such an expansion would always be possible. We 
shall refer to such z m (t) and y n (t) as the stochastic normal modes, to distinguish from 
the previous nonrandom system normal modes. 

When h, (t,r) or G (rtjr't 1 ) is Gaussian, Eqs. 86 and 87 become a complete specifi- 

R K r k i 

cation of the random Green's function. Furthermore, when (88) holds, l£ mn / become 
independently Gaussian random variables with mean 


’mn 


and variances 


k* k k 

P S “CL 

to mnmn mn 


g 


k g k 
mn mn 


0. 


The representation (80) is then in the convenient diversity form 


( 90 ) • 


*H r . t) = 2 g^ n <j> k (?) z m (t) / y n (t') ^(r') dr'dt' [Efr'tM+^r'.t')] 

kmn 

so that the normal modes z (t), y (t), and <k (r) truly occupy a central role. It can 

rci n k. 

be straightforwardly shown in this case that the two noise fields 


Z 

kmn 


^mn ®mn 


) <M r > 


z m (t) ^ y n (t,) < M r ' ) dr ' dt ' 


E(r\ t' ) 


and 


Z 

kmn 


g 4>, (r) z 
6 mti T k' m 


(t) /\ 


/y y n (t*) 4> k (r') dr'df ^(r'.t-) 


■ V 


are independent. Note that the stochastic channel also filters the noise source field 
JHr.t). 

r k i 

We call a diversity representation of the form (90) with independent iS mn } a canoni- 
cal diversity representation, since it diagonalizes the problem for any signal excita- 
tion. If available, it is more useful than diversity representations based on specific 

4 

signal sets as channel representations, for it relates the input and output directly. 

In this Gaussian case we shall frequently not need the explicit construction (84) for 

many applications. Instead a direct characterization of its mean and covariance suffices. 

We mention again that our random Green's function would generally be regarded to 

k 

be specified by (84) with joint distribution on {g }. 


2. 5 Stochastic Signals 


Stochastic signals are easily treated in the nondifferential case (69) with either a non- 
random or a stochastic channel. We need only specify the signal process completely, 
which is always assumed to be independent of the channel and the noise statistics. For 
example, we can choose to expand 

E (r,t) = Z e kn 4> k (r ) y n (t) 
kn 


in a Karhunen-Lofeve expansion when possible, and then specify the joint statistics of 
{e kn }. Other specifications are also possible. 

In the case of differential equation representation, E(r,t) can again be specified in ' 
whatever form is convenient. Since it is an independent input excitation field, no special 
difficulties in its characterization arise as in the stochastic channel case. Furthermore, 
neither the noise source nor the additive noise field are influenced by the stochastic 
nature of E(r , t). 
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2. 6 Relation to Ordinary Filter Description 


Usually a communication system is characterized in the black-box form of Fig. 1 
for an additive Gaussian noise n(t) and a randomly time-variant linear filter h^(t,T). 



n(t) 

Fig. 1. Randomly time-variant linear filter channel. 


This description suppresses the physical aspects of the system. In particular, the 
space coordinates cannot yet be identified. 

A nondifferential distributive description like the one discussed in section 2. 2 can 
be represented in the form of Fig. 2, wherein G^(rt;r' t' ) can also be random. 



N (F , t) 

Fig. 2. Randomly time-variant nondifferential linear 
distributive channel. 


This case can still be considered a special case of the following differential system 
representation if G^frtir't') can be interpreted as the stochastic Green's function 
of a random differential equation. In this case we have Fig. 3. 



(r r 0 N (r , t) 

Fig. 3. Randomly time -variant differential linear 
distributive channel. 
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In the previous discussions we have only treated the output 4>(r,t). A specified addi- 
tive noise field N(r,t) can evidently be introduced with a combined representation 

4i'(?,t)= /‘oo G R ( ?t ; F ' t, )[E(? , ,t')+^ r (? , ,t')] dF'dt' + N(F.t). (91) 

The initial conditions have been suppressed as explained previously. 

It is simple to give the representation of Fig. 1, 

P(t) = / h(t,r) s(t) dr + n(t), (92) 

from the distributive representation ( 91 ) when we know that 

P(t) = / v «!»•(?, t) u(r) dr. (93) 

u 

Such a relation is indeed what usually occurs, say, when we look at a coherence area 
on the received plane of an optical channel. If in this case the signal s(t) is generated 
by a point source at r = 0 

E(r,t) = 6(t) 6(r), 
we have 

h(t , t ) = f y G(rt;r't') u(r) dr'dt' 

v u 

n(t) = fy fy dtdrdr' G(rt; r't' ) u(r ) ^(r 1 , t' ) + fy u(r)N(r,t) dr. 
v 2 u v u 

It is obvious that it is not generally possible to obtain (91 ) from the representation 
(92) even if we know that (3{t) is obtained through iHr.t) with a given u(r), as in (93) 
This should not bother us, since only the representation (91) is a complete specification 
of the situation under consideration. We shall always regard the classical specification 
to be given only if each function or process on the right side of (91) is specified. We 
require such complete specification even when we are ultimately interested only in ( 92 ). 
This is because physical aspects need to be explicitly invoked in the quantum treatment; 
for example, the nature of the variable P(t) is involved. 

When a complete specification is given in the form (91) it is clear that we can form 
still more general communication systems than the one shown in Fig. 1. If we let 

a k = f ^'(r.t') £ k (r,t») drdt* 

for ordinary functions {£, k (r,t)}, we can determine the joint distribution of {a^.a^j-in 
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a straightforward fashion. When 


! k (r,t«) = u(r) 6(t-t*) ( 94 ) 

we would recover the system (92), in that a = (3(t). In general, a choice of {a, } reflects 

IV 

to a certain extent the physical receiver structure or configuration. 

2. 7 Conclusion 

We have developed the theory of classical random field propagation in a particular 
form convenient for translation to quantum treatment. A description of classical com- 
munication systems from this framework has also been given. The novel feature in our 
discussion is that the differential equation channel characterization is quite physical, 
in that it describes the field transmission in the system of interest and is related rather 
intimately to a fundamental Hamiltonian treatment. These points will be discussed fur- 
ther in the quantum case. 

Another apparently new concept that we have introduced is the notion of a random 
Green's function which is particularly important when we use a differential equation phys- 
ical description. When available, it provides complete information on the solutions of 
a stochastic differential equation. It should be worthwhile to investigate such functions 
further because they would have applications in many other areas. 

Finally, we would like to mention that the description of communication systems by 
differential equations, together with appropriate physical interpretations, . should provide 
a useful approach to general communication analysis. 
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C. QUANTUM RANDOM PROCESSES 


We shall now develop an operational theory of quantum random processes to a degree 

sufficient for our future purposes. We shall use the fundamental results established 

here to obtain quantum- channel representations. Rigorous mathematical discussions of 

91-94 

ordinary stochastic processes may be found in many places. General mathematical 

formulations of quantum dynamical theory, . with due regard to the statistical nature 
peculiar to quantum mechanics, also exist in a variety of forms. ’ ’ ” The most 

common form is briefly reviewed in Appendix A. There does not exist, to the author's 
knowledge, any systematic and convenient mathematical theory of random processes 
applicable to situations in quantum physics, although there are fragmentary works both 
of a mathematical* anc j a ca x cu Xa.t ional nature. i n addition, some proba- 

bilistic notions and techniques related to quantum statistical dynamical problems have 
been used by physicists. ^ 73, 118 120 ^ S pite of this, it is highly desirable, at least 
for applications to communication and other systems, to have a common framework for 
treating quantum random problems that is comparable in scope to the discussion of 
classical stochastic problems by ordinary random processes. It is our purpose to 
sketch a primitive version of such a novel theory. 

By a quantum random process we mean a time-dependent linear operator X(t), which 
is defined on the state space of the quantum system under consideration and possesses 
a complete set of eigenstates for every t. A quantum random process, which we often 
abbreviate as a quantum process , is therefore a quantum observable in the Heisenberg 
(or H) picture. See Appendix A for more detailed discussion. We shall first discuss 
time- independent operators and then we treat the time-dependent case. 

It is appropriate to emphasize, first, the differences between our present treatment 
and that of ordinary stochastic processes. In the classical case the system under con- 
sideration, composed of functions f(X) of a random variable X, is completely character- 
ized by the distribution function of X alone. In quantum theory we cannot obtain the dis- 
tribution of an f (X) from that of X in the classical manner when X is non-Hermitian. 
Complete statistical characterization of a quantum system is given differently, usually 
through a density operator. The purpose of our development is to establish convenient 
statistical specifications of a quantum system, while maintaining, as much as possible, 
the applicability and usefulness of ordinary stochastic concepts and methods. Similarly 
to the classical case, such concepts and methods make possible the efficient analysis 
of many quantum statistical problems. 

3. 1 Quantum Probabilistic Theory 

We have to develop some probabilistic concepts applicable to the quantum case before 
we can start our discussion of quantum processes. Appendix A gives a brief introduc- 
tion to quantum formalism and to some of the definitions that we use. The essential 
point of our treatment is to give a c-number description of the quantum observables 
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"X 

under consideration, so that many ordinary stochastic concepts and techniques can be 
transferred to the quantum domain through this c-number characterization. 

3.1.1 Self-adjoint Quantum Variables 

We shall consider a set of commuting self-adjoint quantum observables {x^} of a 
given quantum system. We are not considering a dynamical situation so that the are 
time- independent in any picture. 

As explained in Appendix A, the quantum system under consideration is completely 
specified by its mixed state represented by a density operator p. Let | x^. . .x.. . .) be 
the simultaneous eigenvectors of {30}, with eigenvalues {x..}. The results of simulta- 
neous 30 measurements are therefore distributed with a probability density 

P(Xj. • -x.. . . ) = <x r . . x.. . . | p |x r . .x.. . . ). (95) 

When the eigenvalue set {x^} is degenerate, the distribution is modified to read 

p(x r ; .x.. . . ) = tr. PP{ X .}- (96 ) 

where P{ x } i- s the projection operator for the eigensubspace corresponding to the eigen- 
values {x.}. We shall often use the term distribution instead of probability density for 
brevity, a common practice among physicists. No confusion should arise, since we 
never consider true probability distributions, that is, the integrated probability densi- 
ties. 

As the distribution (95) or (96) is indeed a joint density for {x^}, marginal and con- 
ditional densities can be defined as usual with corresponding physical interpretations. 
We shall not pursue such a development here. 

It is important to observe that the distribution (95) or (96) does not specify p com- 
pletely in general. If we were interested only in the observables X^> or functions 
of them, (95) would provide sufficient information because the off-diagonal elements, 

<x r . .x.. ,. | p | x’j . . . xL ..) x i #xV, 

would not then play a role in the problems of interest. In this case the system density 
operator can then be considered to be effectively diagonal in the |xj. . .x.. . .) represen- 
tation. The quantum statistical problem becomes a completely classical one, given the 
distribution of (95). 

3.1.2 Non-Hermitian Quantum Observables 

Our interest in this work concentrates on systems whose observables are functions 

t + 

of b^, bj, where for each i, b^ and bj are the photon annihilation and creation operators. 
Our attention hereafter will be directed only toward such sets of operators. 
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( 97 ) 




,t 


Let us first consider the case where we have only one set b, b , with 
[b, b T ] = 1. 

A brief discussion of these boson operators is given in Appendix A. If we write 


, b + b^ . ( b - b^ , 

b = 2 + 1 \ 2i / b l + lb 2 

for self-adjoint operators bj and b^, we see that 
[ b T b 2 ] = Y • 


(98) 


Thus we have a situation in which the system observables of interest are functions of 
non-Hermitian operators b and b^" or of two noncommuting self-adjoint operators b^ 

and b, — a case different from that of the previous section. 

z t 121 

The operator b' has no eigenvectors except the null state. In contrast, b has 

an overcomplete ^ 22 set of eigenstates |p) with complex eigenvalues p. 11 ^ 121 

b| P) = p|p) (99a) 



(99b) 


d 2 p = d(Re p) d(Im P). 

These coherent states | p) are nonorthogonal 1 123 

<P I P' > = exp{pV - | p| 2 - - | p' | 2 } 


(99c) 


(100a) 


but properly normalized 

<p| p> = 1. (100b) 

Arbitrary functions of b and b^, the observables of interest to us, can be written 

74 124 t 

in different operator orders. * A function f(b, b' ) is said to be in normal order if 

t 74 

every b stands to the right of every b 1 . We write 

f(b, b^) = f (n) (b, b*) (101a) 

to indicate that an observable has been written in normal order. If we now replace b 
and b' in f v '(b, b 1 ) with p and p , two c-number complex variables, we have an ordi- 
nary function of two complex variables 

f (n) (p, p*) (101b) 


30 


where the bar on f reminds us that f is a c-number function. The variables p and p* 
will be called the associated classical amplitudes of b and b^, and f^(p, p*) will be 
called the associated classical function of f^(b, b^). 1 25 Given (101b). we can clearlv 

ji- J. • 

recover (101a) merely by replacing (P, p ) with (b, b' ) and write the resulting function 

in normal order. This correspondence between normally ordered operators and their 

74 125 

associated classical function can be expressed ’ as 


f (n) (b, b*) = n{f (n) (p,p*)} (101c) 

with the introduction of a normal ordering operation n. 

Similarly, we define a function f(b, b^) as in antinormal order 

f(b, bt) = f (a) (b, b*) (10 Id) 

when each b^ stands to the right of every b. Analogous associated amplitudes and asso 
ciated classical functions can be introduced so that 

f (a) (b, b 1 ") = J3/{f (a) (p,p*)} 

for an antinormal ordering operation si . Further discussions of operator orderings 

, , ,74, 124, 125 

may be found. 

It has been suggested 11 ^’ 12 & -128 that for a broad class of functions f(b,b^), we can 
expand 


f(b,b t ) 


■y 


-(a) * 

f (P. P ) 


d 2 p 

PXPl — 


( 1 0 1 e ) 


The precise conditions of validity for (lOld), a subject of much discussion and con- 
troversy,^ 2 ^’ 12 ^’ need not concern us here. In all of our applications its validity 
can be established. 

Applying (lOle) to the system density operator p yields 

P = / P(P, P*)|p> <P| d 2 p, (102a) 

where the function P(P, p ) 


P(P,P>jP (a) (P,p") (102b) 

119 

is commonly called the P- representation of p. The possibility of such a diagonal 
expansion of p rests on the overcompleteness of the eigenstates | p). Note that p is 
not actually diagonal in the | p ^representation because 

<p| P |p-> = o P*P' 
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121 

implies that p = 0. 

Because all observables of the system are functions of b and b^\ we can evaluate 
averages of f^(b, b^) with the representation (102a) in the form 1 

< f(b, b^)> = <f (n) (b, b^)) = tr. P (a) f (n) (b, b*) 

(103a) 

= / P(P.P )f W (P,P ) d^p. 

Similarly, we have 12 ^ 

< f(b, b^)> = <f (a) (b, b^)> = tr. P (n) f (a) (b, b*) 

2 (103b) 

= / P(P, P*) f (a) (f3,P*) 

where 

p(P, P*) = <P|p|P> (103c) 

are the diagonal elements of P in the coherent representation | p). 

3.1.3 Quasi Densities and Characteristic Functions 

3 9 

The expression (103c) has been shown (see also Appendix E) to be the probability 
density describing the outcome p of quantum measurements of b. Of course (103c) then 

possesses^all of the usual properties of a probability density. Also, it is an analytic 

* 

function of two complex variables P and P , and, therefore, is a very well-behaved 

121 * 
function. For the same reason, P is also completely specified by P (P, P ). 

In contrast, the P-representation (102b) is not an ordinary probability density. It 

can become negative and quite singular, involving an infinite sum of arbitrarily high- 

127 120 

order derivatives of the delta function. ’ The general usefulness of P(P, P ) rests 
on the fact that normally ordered averages can be computed in a classical fashion (103a), 

as if P(P, P ) were a probability density. Furthermore, the density operation P is also 

❖ 

completely specified by P(P, P ), as is evident from (102a). In our later applications 

«ju 

*»*■ 

P(P, p ) will usually be positive and nonsingular, obeying many mathematical properties 
of a density function. It is still not interpretable, however, as a distribution describing 
outcome probabilities of certain quantum measurements. 

•A. 

•V 

Our purpose is to find c-number variables (p, p ) with corresponding given 'quasi- 
density' functions such that all quantum information of the system under consideration 
can be obtained. We are therefore asking for a convenient c-number characterization 
of the quantum system so that the classical stochastic concepts and methods may be 
carried over. 

We refer to P (P, P ) and P(P, P ) as quasi densities because their role in many 
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applications is similar to that of probability densities in ordinary random problems. The 
c-number variables (p, (3 ) are called quasi-random variables or quasi variables accord- 

S»C 

ingly. Note, in fact, thatp((3,|3 ) is a true density. It is convenient to lump it with 
P((3, (3 ) for our purposes. 

To emphasize that the density operator p is really needed to describe outcome 

distributions for measurements of f(b,b^), we observe that we cannot calculate the 

1 " * 

distribution of, say, b'b from that of p((3, (3 ) by making a classical random variable 
transformation. This is an intrinsic quantum property that distinguishes quantum and 
classical statistical elements. 

Since averages of f(b, b^) can be calculated by different kinds of orderings of b and 
b^ as in Eqs. 103a and 103b, we find it convenient to define several characteristic func- 
tions. 1 0 Specifically, 

4> N (ib M*) = tr. {pe^ e -li b } (104) 

<t> A (hb H*) = tr. {pe ^ he^ }. (105) 

Here 4 , nN j(l JL . H*) and (H-. H-*) are called normal and antinormal ordered characteristic 
functions, respectively. They are Fourier transforms of P(|3, p ) and p(P, P ) 

4> N (P,P*)= I P P(p,p*)d 2 p (106) 

-j I _|A P p(P,P*) d 2 p. (107) 

Their interpretation as characteristic functions is obvious because normally and anti- 
normally ordered averages are computed from them in the usual way that averages are 
computed from characteristic functions. 

120 

We can also define the symmetrically ordered characteristic function 

4> S (P. (a*) = tr. {pe^ h} (108) 

whose Fourier transform 

W(P,P*) = J e 1 " K 4» S (P, \l*) ^ (109) 

120 * 

is the Wigner distribution. We shall also call W(P, P ) a quasi density. These quasi 
densities (102b), (103c), (109) and their corresponding characteristic functions (104), 
(105), and (108) are essential tools here. Note that the characteristic functions are in 
1:1 correspondence with their corresponding quasi densities, which are again in 
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1:1 correspondence with the system density operator p. In spite of the fact that the quasi 
densities are not generally interpretable as true probability densities with proper under- 
lying spaces, it is important to emphasize again that they can be used to compute various 
operator-ordered quantum averages. 

The characteristic functions are related by 


el/ ^ ^ 4> S (P. P*) = el ^ 4> A (M-, M- 

The quasi densities can be related through (110); for example, 
p(P, P*) = I e'l p_p, l P(p, P*) d 2 p 


(HO) 


(HI) 


and so forth. 

Before we turn to further specific development let us mention that different operator 
orderings and their corresponding quasi densities can be introduced for operators that 
are linear combinations of b and b^, for example, the usual conjugate variables q and 
p. We shall not pursue a detailed discussion here, since it should be clear how such 
a development can be carried out. 

3.1.4 Gaussian Quasi Variables 

jjc 

We now define a quantum system and its density operator to be Gaussian when P(P, p ) 
* 

is Gaussian in (P, p ), i. e. , 



with 

/ d 2 p P(p, p*) = 1. (113) 

The observable b in such a case will be referred to as a Gaussian observable. From 
Eq. (Ill) we have, for a Gaussian system. 



(114) 


with 

r ' = r + - 5 -. (115) 

o’ cr 
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By standard techniques of operator algebra the density operator is therefore of the 
form 


p = np (P, (3 ) = N exp ■ 


• exp < 




Vb 


2 

2(l-r' ) cr 


In 1 - 


. 2 . * 


r' 2 ) \ 

* 

cr <r 

*2 

cr 

\ + 1 


r 

h tJ 

► exp < 


J j 

2(1 


r'P P 
o o 


1 2 2 2 

2(l-r* ) cr (1-r 1 ) \(t cr cr/J 


( 116 ) 


where N is a normalization constant. This density operator cannot be brought into a 

* 

single exponential, except in the obvious special case when a = cr = 0. 

It should be clear from our definition that if b and b^ is a pair of Gaussian observ- 
ables, then any linear combination of them will also give rise to Gaussian quasi den- 
sities. 

An important property of our Gaussian system lies in the following theorem. 


Theorem 1 

When a photon system described by (b, b^) is Gaussian in the sense of (112), the third 

and higher order cumulants defined in any order are all vanishing. 

The validity of this result rests directly on the c-number commutation rule for 

[b,bt], To prove the theorem, we need only observe that from (112) and (97) the quasi 

densities (102b), (103c), (109) and the characteristic functions (104), (105), and (108) 

are all Gaussian in their respective variables. The higher cumulants^ or linked 
64 

moments of various operator orders are therefore all zero, as in the ordinary 
Gaussian case. Furthermore, it can be seen that we can define our Gaussian system 
with any one of the quasi densities or characteristic functions, since Gaussian properties 
of all others follow immediately in each case. Note that the explicit Gaussian form of 
p(P, p ) has been given in Eq. 114. 

% 

We wish to characterize our quantum system completely by the c-numbers (P, p ), 

which we call quasi variables in analogy with ordinary random variables. As we have 

* 

said, this is possible because given a Gaussian density of ((3,(3 ), we can write down the 
system density operator p. We have just seen that we can give simple descriptions of 
these quasi variables by discussing quasi densities like ordinary distributions. Thus 
the usefulness of our Gaussian definition stems from the fact that the resulting quasi 
densities and characteristic functions are easy to deal with as in the classical Gaussian 
case. Only a few parameters are required for their specification. 

It is important to observe that under our Gaussian definition the distribution P(P, p ) 
is positive for all (P, p ) and is also a smooth function. It can then be seen that many 
ordinary stochastic concepts can be introduced because all of the quasi densities possess 
mathematical properties like strict probability densities. 
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Let us now generalize the above development to the many-variable case. We consider 
a set of observables {b, , b£} in a total system with subsystems denoted by ^ . We 


assume that the observables of subsystem d are functions of b^ and b^ only and that 


each b^ is a photon annihilation operator 


b k’ b k 


(117) 


r d2 Pu 


(118) 


The commutators 

[ b k . b k .] =0 V k,k' 


and 


b k' 


k # k' 


(119) 


( 120 ) 


are further taken to be given c-numbers. 

We define the set {b^, b£} to be jointly Gaussian with corresponding quasi variables 


p^} when the P-distribution P(f3, p ) for the total system operator 


p = / P(P,P*)|P> <p| n d 2 p. 


( 121 ) 


is Gaussian in {p, p }. We have used the notation p to denote the set {P k } and 

|P> £ |p r ..p k ...> (122) 


represents the eigenstates of 


V" V-- >y • ? k ... |p,-..p k . 


(123) 


The P-distribution in (12 1) can be used to compute normal-order averages, that is', 
averages of operators where all the b^ stand to the right of all the b£. By the c-number 
character of Eqs. 119 and 120, various subsystem and operator-ordered characteristic 
functions and quasi densities are all Gaussian, so that various higher order cumulants 

vanish. Our situation here is completely analogous to the single subsystem case. 

❖ 

While we shall not write down the explicit form of P(P, p ), it is appropriate to note 
that it is completely specified by the mean 


s k = <4>' 5 k =<b k> 


(124) 


and the covariances 
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(( b l-\) < b k-y 

« b k'-V'V b k» = « b k- b k» b k'- b k'»> - <( b k- b k)( b k- b k'))* 


(125) 

(126) 


for an arbitrarily chosen ordering among the k's. Together with (119) and (120), we can 
obtain all of the other subsystem operator-ordered quasi densities and characteristic 
functions. To show how one can write down a characteristic function, we have the fol- 
lowing normal-ordered characterization function corresponding to P(f3, (3 ) of 


4» n (M:> !± ) 


tr. < 


n e^ 1 ^ n 


~^k b k 


(127) 


where 


e = Kl- 

it is intuitively clear that our joint quasi densities have the usual properties of 
jointly Gaussian distributions. Defining the marginal quasi densities for a subset of 
K- b l> in a manner analogous to the ordinary case as, for example, 

1 " d X' 


we can state the following theorem. 


Theorem 2 

If {b k . b£} are jointly Gaussian, then any subset of them is also jointly Gaussian. If 

{a^, a£} are obtained from linear transformations of {fc>^, }, then { a k> a £} are also jointly 
Gaussian. 

This theorem can be proved in exactly the same way as the classical results are 
proved, since the proof depends only on the form of the quasi-density functions. Further- 
more, any one of the subsystem operator-ordered quasi densities can be used, as they 
are all Gaussian. 


3.1.5 Statistically Independent Quasi Variables 

To continue our development of the properties of jointly Gaussian quasi densities, 
we first define the important notion of statistical independence of quasi variables. While 
it is possible to define conditional quasi densities, we choose directly our fundamental 
definition of complete statistical independence between subsystems to be the factor- 
ization of the total system density operator 

P = Pj <8> P 2 0 • • • 0 P k • • • (128) 
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into a direct product of subsystem density operators. 

With this definition we see that 

(«.( b r b l) • • • AMD ■ • ■> " (8i( b ,. b !)> • ■ • («k(*V b D) ■ 

where is an arbitrary observable of subsystem -d Also we have 

t b k > b k >] = 


( 129 ) 


b k’ b k' 


= 0, 


k 9= k' 


because (128) implies that the b^ are defined on different Hilbert spaces. 

We may state now the following theorem. 

Theorem 3 

X t 

The jointly Gaussian { b k f b k l are i nbe P enben t if and only if (119) and (120) and (125) 
and (126) are zero for k + k' . 

To prove this in the two-subsystem case, we note that with (129), (119), and (120) 
we can write 


<Pjp 2 l p IPjPj,) = <pJ <p 2 | pj ® p 2 I>i>Ip 2 > 

- (Pjl p l ^2 I p 2 I ^2 ^ 


so that the covariances are zero. The argument can be reversed to show the converse 
statement. Generalization to the multivariable situation is clearly straightforward. It 
can be seen from Theorems 1-3 that the Gaussian quasi variables {p k> P k } have many 
properties of ordinary Gaussian random variables. 

It is also fruitful to define normal- order statistical independence by the factoriza- 
tion of the P-distribution. For example, in the two- subsystem case 

K'Vr'^z) - p (l>rf>I) < 130 > 


In this situation the normal-ordered averages will factorize 

<Q< n >( b l, b t;b 2 . b|)) = b()) (Q™(b 2 , b I)) . 


(13 J.) 


but the antinormal ones may not. The difference between (128) and (130) is that in the 
latter case the commutator 


b l 


k * k' 


may not vanish. It is again obvious that jointly Gaussian {b^, b^} are normal-order 
independent if and only if (125) and (126) are zero for k + k' . Henceforth, the word 
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'independent' will imply complete independence, whereas normal-order independence 
will be given in full terms. 


3.1.6 Sums of Independent Quasi Variables 


We now consider two statistically independent subsystems d and d „ with a total 

1 C* J- 

density operator p. Each subsystem contains observables that are functions of {b, , b* }, 

iv K 


V b I 


which are merely c-numbers 


k = 1,2, which we allow to possess commutator 
that need not be unity. Equation 119, of course, holds in this case. We introduce the 
operator 


b = b 1 + b 2 (132) 

and consider the quantum system whose observables are functions of b and b^. We call 
this system the 'sum system.' The quantum- state space of this sum system is con- 
structed in the following way. Let 


Ip) = |Pj) ® |p 2 > 


(133) 


be the eigenstates of b. Here | Pj) and | P 2 ) are eigenstates of b^ and b^, but they are 
not necessarily complete in the systems ^ ^ and 4 The eigenvalues p of b are 

P=P 1 +P 2 . - (134) 

where Pj and P 2 are eigenvalues of b^ and b 2 - There is in general an infinite set of 
eigenvectors | p) with the same eigenvalues p. For our purpose, all of the eigenvectors 
I Pj) <8> | P 2 ) associated with a given | p) are equivalent, so that we can pick any one of 
them. The state space of our sum system is then the space spanned by the chosen | p). 
Clearly, the sum system is not the total system ^ + d 

4. A £ 

Let us assume that the b of (13 2) obeys [b, b'] = 1. Furthermore, the eigenstates 
of b chosen in the manner above is clearly complete in the sum system. It is then 
meaningful to state the following theorem. 


Theorem 4 

The density operator p g of the sum system can be represented by a P- distribution 
which is the Fourier transform of 



(135) 


To prove this statement, we observe that by statistical independence 


tr. p e 


n(bj+b|) -p r (b 1+ b 2 )_ 


= tr. \ p 1 e 


t * 

pb' -p b 

e J tr. \ p 2 e 


Pbi, -p b 2 


(136) 
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Furthermore, 


tr. p e 


K b H) -^w 


= tr. p 


, f * 
e^ b e^ b 


TSTb 


(M-i H*)- 


Since the state |p) is complete in our sum system, the Fourier transform of (136) is 
the P- representation of the density operators p g . A more complete discussion is given 
in Appendix G. 

Let us define 



(137) 


(138) 


(139) 


We have the usual convolution formula 

PJ,(P.P*) = / P'jfP-p'.P*-?'*) Pi,(P’.P'*) d 2 p. (140) 

119 * * 

This formula has been derived heuristically when P'j(P, P ) and P1>(P, P ) are both the 

P- representations of p and p , the subsystem density operators of (128). Our devel- 

1 * 

opment shows that P! ((3 , p ) cannot then be interpreted without proper scaling as the 

D 4. 4- 

P- function of p g because in such a case [b, b 1 ] = 2. Thus when the commutator [b, b' ], 
and also 


b i' b ! 


b 2’ b 2 


(HI) 


are arbitrary c-numbers, our formula (135), (136), and (140) still retains its valid- 
ity, although none of P^p, p*), P'^P, P*), and P^(p,p") may be interpreted as a 
P- representation. 

It is easy to see from the c-number character of the commutator involved that other 
ordered characteristic functions also factorize. This is also evident from (128), as a 
consequence of statistical independence. Therefore the other quasi densities for the 
sum system can be obtained by convolving the corresponding Fourier transforms of the 
subsystem characteristic functions. Formulas like (140) are then not special to the 
P- representation, but hold for other quasi densities. Since each of the quasi densities 
represents the sum system uniquely, we can use any one for convenience. 
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In the case when Eq. 97 holds, it can be seen from the same argument that the com- 
mutators (141) are not important for the representation of p g as long as their sum is 
unity. In general we have the following theorem. 

Theorem 5 

Consider a sum of N independent subsystems with 

b = b l + b 2 + + b N (142) 


and 

[b, b 1 "] = 1 . 

The P-distribution of p is then obtained by convolving successively 

s 


* * 


P^P.P*) = y e pK ^ P 




,2 

* d p 


2 ’ 


(143) 


where for given 


* a / ^ b f 

♦ub * ) = ( e e 

i 


(144) 


the resulting P^fP, P ) is independent of the distribution of the c-number commutators 


b.,bt 

l i 


= c r 


(145) 


Thus it is easy to consider a classical subsystem with 




0 added to other quan- 


tum subsystems. It can now be seen that factorization of characteristic functions is 
the more fundamental formula for summing statistically independent quasi variables. 

In the case of normal-order statistical independence our development is valid when 
restricted to normal-order averages and characteristic functions. In this case the con- 
volution of the form (140) still holds, but now the other quasi densities cannot be con- 
volved in the same manner. Normalization of the commutator is again required. 


3.2 Quantum Stochastic Processes 

We now turn our attention to stochastic processes. As we have mentioned, by a 
quantum stochastic process we mean a time -dependent observable in the H-picture. 


3.2. 1 Fundamental Characterization 

Since our quantum process is a dynamical variable, it will generally be character- 
ized by the statistical dynamical equation it obeys, together with the initial system 
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density operator. Since such information is not generally available, we may want to 
seek other convenient characterizations. 

Let us consider the photon operator b{t) with 


[b(t), b t (t)] = 1 (146) 

and the quantum system composed of observables that are functions of {b(t), b^(t)}. In 
such a case the equation of motion for b(t) or the equation of motion for p(t), the system 
density operator, gives the complete specification of the system behavior. It is more 
convenient to use b(t) for a general characterization because we can then readily obtain 
other density operators. 

A general abstract characterization of b(t) can be given in terms of the multi-time 
quasi-density functions. 


p (Vn‘n ; • • • “toi) -■ ( 6 (f>r bt(l i>) • • • 6 «- bt <*„>) 6 «V b «n» ■ ■ ■ 6 »r b «i») ■ 


(147) 


where the average is with respect to an initial system density operator p(o) and 


6(p*-b t ) 6(p-b) 


is defined by 
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M(b, bh = n{M (n) (P, (3*)} 

= / M (n) (p, P*) 6(p' J "-b t ) 6(p-b) d 2 p. 


(148) 


Other sequences of such delta operators are defined similarly. For a complete specifi- 
cation of b(t), we need to know (147) for any time sequence {h; i= 1, . . . n}, and to know 
the commutation rules [b(t), b(t' )], 

[b(t),bV)] t = t’ (149) 

which are taken to be c-numbers. We shall assume that 

[b(t),b(t')] = 0. (150) 

The multi-time P-distribution (147) can be written as the Fourier transform of a 

70 71 

multi-time characteristic function, in case we do not like delta operators. ’ Further- 
more, it can be interpreted as the P-representation of a density operator describing 
measurement output probabilities of observables which are functions of 


bftj), b(t n ). 


The specification of a general quantum process is at least as complicated as the 
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specification of a classical random process. We shall examine certain special cases 
for which a complete specification can be given in a relatively simple manner. 

3.2.2 Gaussian Quasi Processes 

We consider the pair of photon operators {b(t), b^(t)} obeying (146). Their associated 
classical functions {(3(t), |3 (t)} will be called a quasi process. We define a quasi process 
to be Gaussian when its multi-time P-function (147) is a Gaussian in the variable {p(t.), 

*i.i 1 

(3 (t.)| i= 1, . . . , n} for every n. From the c-number property of (149) and (150) different 
quasi densities with any time order are Gaussian. It also follows that any operator and 
time-ordered quasi densities and characteristic functions are Gaussian. The situation 
here is similar to the many Gaussian quasi- variables case. We have therefore the fol- 
lowing theorem. 

Theorem 6 

When {b(t), b^(t)} is a Gaussian quantum process if and only if the third and higher 
order cumulants defined in any time operator order. are all vanishing. 

Note that for the quasi process {(3(t), (3 (t)} to be Gaussian only one quasi density or 
characteristic function need be Gaussian for each n sequence {(3 (t ^ ) p (tj); ; p(t ) 

With (149) and (150) a complete characterization of a Gaussian quasi process is 
given by the mean 

b t (t) = <b r (t)) = <b(t)>* (151) 

and the covariances 

<(b t (t)-bF)*)(b(s)-b(^j)) = C + (t, s) (152) 

b'b 

<(b(t)-l^))(b(s)-b(i))> = C bb (t, s) (153) 

from Theorem 6. We shall not write an explicit form of the multi-time P-function, as 
it is the same as a classical distribution. 

We can see that all usual properties of a Gaussian process are preserved in every 
one of our quasi-density functions. Jointly Gaussian processes can be similarly defined. 
The notion of statistical independent processes also follows in the same manner. We 
shall not pursue a detailed development here. 

3.2.3 Karhunen-Lofeve Expansion for Quantum Processes 

We shall now develop a Karhunen-Lofeve expansion theorem for a class of quantum 
processes. Let b(t) be an observable with mean (151) and covariance (152) and (153). 

No Gaussian assumption is made and (146) is not needed. Let 
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( 154 ) 


C + (t,s) = 2 A,*( t ) * ( S ) 
b T b k 1 k k 


C + (t, s) = {(b(t)-b(t))(b^ (s)-b(s)*)) = 2 X.^4 (t) <j>*(s) 

Wi' I, ^ K K 


Sbb^’ s ) = 0 


<b(t)> = 2 a k 4> k (t), 
k 


f \(t) 4» k ,(t) Wj(t) dt = 6j 


2 <t> k (t) <t> k (t') Wj(t')= 6(t-t' ) 
k 

similar to Eqs. 7 5 and 76. We have then the following theorem. 
Theorem 7 

For a process b(t) obeying (154)-(156) we can expand 

b(t) = 2 c k 4> k (t) 
k 

with a set of operators a k having mean 

<c k >= \ =(a i>* 

and covariances 


{(4"“k) (o k'" S k' , ) ,! 6 kk'4 
/(a k ,-v)(4-4)) = 6 kk' x 2 


^ a k' - “k , ^ a k”“k^ = °* 


To show the possibility of such an expansion, we observe that 

« k = f 4 (t) W l (t) dt 


((4~ S k) (a k“ S k' , ) = 1 C b tb (t ’ S) 4>k<t) Wj(t) ^' (S) W](S) ^ 


= X Kk- 
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Equation ( 163) follows similarly. When b(t) is Gaussian the quasi variables corresponding 
to the operators will then be statistically independent. 

We next define a white Gaussian process to be one for which 


(b^(t)b(u)) = Vj6(t-u) (166) 

(b(u)b^(t)) = v 2 6(t-u) (167) 

<bt(t)bt(u)> = v 3 6(t-u) (168) 

<b(t)b(u)> = \ 3 6(t-u). (169) 

We also have an expansion theorem similar to Theorem 7. 

Theorem 8 

A white Gaussian quantum process b(t) with zero mean and correlation (166)- (169) 
can be expanded in any real orthonormal set 


b(t) = 2 « k <fr k (t) 
k 


(170) 


f \(t) \,(t) dt = 6^, 


(HI) 


such that the associated classical variables of the operators a are statistically inde- 
pendent Gaussian quasi variables 


^ a kV ^ = 6 kk ,V l 


W' a k^ 5 kk ,y Z 


< a k a l )= 6 kk’ Y 3 = <a k a k'^ 


(172) 


(173) 

(174) 


The proof of this theorem can be carried out in a straightforward manner similar to 
the previous one, and is therefore omitted. 

Expansion theorems of this type are clearly useful for many purposes. In classical 
communication theory their applications are well known, and we would expect that they 
would also play an important role in quantum communication analysis. 


3.2.4 Markov Quasi Processes 

Another useful characterization of quantum processes is possible by using the 
70 

Markov idea. We let 
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6 «- bt(t n») 6 ®n- b<t „») 

' I p (CPn’*„l«* “-‘n-l) d2 “ ( 6 ("*- bt Vl>) 6 <“- b( *n-l»’)' tbt 


as the definition for the conditional distribution 


P(P , |3, 1 1 a , a, t') 


t > t'. 


A quantum process is then defined to be Markovian if its multi-time P- function of (147) 
obeys 7 ® 


t > t ,>...> t, . (178) 

n n-1 1 ' 

6> 5 TO T 3 

This property can be shown to be equivalent to the quantum regression theorem ’ 

with which general multi-time quantum averages can be computed by two-time results. 
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Other characteristic properties of what we call a Markov case ’ also follow from 
(177). If the P-function of (147) is a classical density, (197) is the usual definition of 
a Markov process and gives rise to the Chapman-Kolmogorov-Smoluchowshi con- 
dition. 

It can be seen again from the property of c-number commutators involved that the 
Markov property ot one quasi density implies that all other quasi densities are 
Markovian, obeying relations similar to (177) and (178). 

As in the ordinary Markovian case the one-time P-function of a quantum Markov 

j*c 

process P(P, p , t) can be shown to obey a Fokker-Planck-Kolmogorov equation under 
appropriate conditions, as the derivation involves only the condition (177). In the same 
way the conditional Green's function solution to the Fokker-Planck-Kolmogorov equa- 
tion^ 0- ^ 4 is the conditional distribution (176). Thus a one-time density operator con- 
ditioned upon an initial distribution contains full information about a quantum Markov 
process. 

We call the associated classical function P(t) of a quantum Markov process b(t) a 
Markov quasi process. Thus Markov quasi processes have quasi densities that are 
completely characterized by a Fokker-Planck-Kolmogorov equation under conditions 
that will be obeyed in our applications. Specification of such a quasi process can then 
be given in terms of the drifts and diffusion coefficients^ 0 

A„(t) = lim r~r < b (t+ At ) - b (t ) ) (179) 

P At— -0 
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(180) 


2! D (t) = 2! D + (t) 

p p b'b 


= lim ~rr <[b^(t+At)-b^(t)][b(t+At)-b(t)]>. 
At— 0 


\ 

6 ) 5 1 1 

Generalization to include higher order diffusion terms is also possible. ’ 

In our application we shall adopt a Langevin rather than a Fokker- Planck point of 

view. In such a description our quantum process is defined by a quantum Langevin 

, . 69 

equation 


Lb(t) = e(t) + f(t) 

t t * t 
L'b' (t) = e (t) + f' (t), 


(181) 


where L is a linear time differential operator, and e(t) a deterministic excitation. The 
noise process f(t) is an operator 

[f(t).ft(t)] * 0 

and is usually taken to be Gaussian. .When 


[f(t),fV)] a s(t-t') 

the observables {b(t), b^(t)} will be components of a vector quantum Markov process. 
When L involves only first-order time derivatives, {b(t), b^(t)} then becomes a quantun 
Markov process. 

With proper ordering interpretations we can find, as previously, the c-number 

quasi processes {p(t), (3 (t )} which correspond to {b(t), b^(t)} and which obey a c-number 

Langevin equation. Usual relations between Langevin equations and the corresponding 
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Markov processes will still hold for this quasi process. When the equation is 

linear the situation is particularly simple. We shall not give a detailed development 
here. 


3.2.5 Stationary Quantum Processes 

Stationary processes play a particular role in our study similar to Markov pro- 
cesses, since they obey some simple fluctuation-dissipation theorems. We define a quan 
turn process b(t) to be stationary when all multi-time averages are invariant to a shift 
in origin. That is, 

(b^tj). • .b T (t n )b(t n ) . . .b(y> = (b^tj-y . . .b T (t n -t o )b(t n -y . . .b(t r t o )>. 

(182) 

In our application we actually need only 
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etc. 


(bUt)b(V)) = <bt(t-t')b{o)> 

for second-order averages, since we deal, for the most part, with Gaussian processes. 
The stationary processes that we shall encounter will be described by linear time- 
invariant differential equations. 

3.3 Conclusion 

We have developed certain fundamental quantum stochastic concepts that will be 
employed in the following sections for quantum channel characterization. While we shall 
actually be talking about statistical quantum fields, rather than processes, there is no 
need to discuss them separately. Quantum fields bear to quantum processes relations 
exactly analogous to those of classical random fields to classical random functions. 

The most important idea in our previous development is Gaussianity, since we shall 
deal in the quantum treatment exclusively with Gaussian additive noise. Our results 
therefore lay the groundwork for treatment of such quantum processes. The notion of 
a quasi process will make classical and quantum-channel comparisons more efficient 
and illuminating. 
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D. QUANTUM FIELD PROPAGATION AND 
CLASSICAL CORRESPONDENCE 


We shall develop the theory of quantum field transmission through a linear system, 
employing quantum processes that have just been described. Our quantum discussion 
will closely parallel the development of classical random field propagation in Sec- 
tion I-B. In particular, we shall establish the way in which given classical field specifi- 
cations give rise to corresponding unique quantum field specifications. With this 
connection we can then set up the quantum channel representation directly from the given 
classical channel. 

We shall restrict our consideration to deterministic channels. Stochastic channels 
and signals will be discussed in Section I-E. 


4. 1 General Theory of Quantum Field Propagation 

We consider the transmission of electromagnetic quantum fields through media char- 
acterized by linear partial differential equations of field propagation. The nature 
of our channel is exactly the same as that previously discussed. Instead of c -number 
wave fields t) we are now just treating q-number fields 4> (r, t). Our discussion 

is a generalization of section 2. 1 to the quantum region, by application of our results 
in Section I-C. 

- 74-77 t — 

Let ih 0 p ( r , t ) be a scalar field operator with adjoint i|F p (r,t) and from which the 

electric and magnetic fields can be obtained by linear operations. See Appendix F for 

details. The dynamical field equation describing our channel is 


££ + op (r,t) = E(r , t) + J^ op (r,t), 


(183) 


where & is a linear partial differential operator with' respect to t, the components 
of r, E(r, t) a c-number deterministic excitation, and ^” op (r, t) a random -noise -source 
operator. As in Eq. 5 we take 

(184) 

for two ordinary differential operators with respect to t and r. The noise -source oper- 
ator has zero average 

<^op (? - t} > = ° (185) 


and is taken to be a Gaussian quantum field. Although we only discussed quantum pro- 
cesses in the last section, it should be clear that 4» Q p{r,t) is a Gaussian quantum 
field if and only if all of its linear functionals 

/ «|» (F, t) W(?,t) dFdt (186) 
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are Gaussian quantum observables. 

Since -£f factorizes in Eq. 184 as in Eq. 5, we have (Eqs. 6 and 7) also 
= a k +v n ) \n (? ’ t) 


V (r<t) = V F) y n (t) 


(187) 

(188) 


for the c -number eigenfunctions 4> kn ( r >t) of if . 

In the present quantum case we have to know the commutator 


L V r ’ t) ’ 4 'op (r '' t,) . 


(189) 


or equivalently 




(190) 


for a complete characterization of the quantum field. We shall discuss later how 
Eqs. 189 and 190 may be determined in different situations. Here we generally assume 
that the noise field (r, t) and, in particular, the commutator (190) are . specified. 

It should be clear that our present consideration is again restricted to Gaussian noise 
added to the electromagnetic fields, and to linear space -time filter systems. 

4. 1. 1 General Case 

Let us assume that (190) is given and that 


_ D (r',t')> = Z C k + (t,f) <£(r) i(r') 

°P °P k K K 


op 


(191) 


<jr nn (?,t)jr nn (?., t ')) = 2 C^(t,t') <t> k (r) ^(r«) 


op 


op 


( 192 ) 




This •^" 0 p( r > t ) i s taken to be the noise source driving the wave equation for the field 


l ♦d ? 1 V * 1 


(193) 


with photon operators b^(t) 


b k (t), bj(t) 


= 1 . 


(194) 
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We assume (see Section I-B) that 


/ <f> k (r) 4> k .(r) dr = 6 kk- 


( 195 ) 


k 


(196) 


A knowledge of (190) and (191) is equivalent to that of (191) and ( 2F t)^"£ (r \ t')) 
when (190) is a c-number function that we shall assume. We take 


<^op (r ’ t) ^L< r, * t, )>= Z C \ **) 4> k (r) <t>*(r'). 


op 


(197) 


k jr' 


Thus we are assuming that (190) is in general of the form 



(F.t),jrtp( ? . 


t') 


r{c k + (t,t')-C k + (t, t')} <|>,(?) 4>*(r'). 

k jrjr T jrV k 


( 198 ) 


If we expand 

' Jr, t) = 2 4 k (F) F (t) (199) 

Op k K K 


for noise operators {F k (t)}, we see from (192), (197), and (198) that 


F (t),Fj,(t') ={c k ,(t,t')-C k . (t,t’)}6 , 

L k J jrjr’ 

(200) 

[F k (t),F k ,(t-)] = 0 

(201) 

and 


<iyt)> = 0 

(202) 

<pJ(t)F k ,(t')>= 6 kk ,C^_ t ^(t,t') 

(203) 

<P k «)F k ,(t')> = 6 kk ,C^(t,t'l. 

(204) 

Thus as a direct consequence of (191), (192), and (197) we have 
observables F k (t), If we also write, as in Eq. 15, 

independent Gaussian 

E(r, t) = T ^(r) e k (t), 
k 

(205) 


Eq. 183 is reduced to the following set of ordinary differential equations for the oper- 
ators t> k (t) 
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(206) 


(X. k +i? j) b fc (t) = e k (t) + F k (t), 

where the correlations of the Gaussian operators F, (t) are as given by (200) -(204). 

K 

Similarly to the discussion of Eqs. 22 and 23, we can write the solution of (206) in 
the form 

b k (t) = -C h k (t ’ T) e k^ T) dT + n k P * tJ (20?) 

o 

with an additive noise operator 

n£ P (t) = h k (t, t) F r (t) dT (208) 



h k (t, t) F fc (r) dT 



h k (t, t) F k (T) dT. 


(209) 


The h k (t, t) is again the zero-state impulse response of the differential system (Eq. 16). 
The second term on the right-hand side of (209) represents the contribution to the addi- 
tive noise n° p (t) of initial conditions. It can be seen that 

[b k (t),b k ,(t')] = 0 (210) 


b k (t)- bT, (f) 


= 6 


kk' 


J‘-oo dT £« ds h (t,T)h k (t',s){c k 




.(t, s) - C k . (t, s)} 

' ^ rT ^ r 


( 211 ) 


<b k (t),bj,(t')> 


6, . , f 1 dT ft 

kk' -oo 


-oo ds h k (t ’ t) h k (t ’’ 


s) CT 


.(t, s) etc. 


( 212 ) 


3F 


Thus the different normal-mode operators b k (t ) are statistically independent Gaussian 
observables, since they are linear transformations of the independent F, (t). 

K 

With the representation (207) it can be seen that for a fixed -input excitation each 
Gaussian t> k (t ) is completely specified by the correlations of F k (t) and h k (t, t). Our field 
(r, t) is therefore also completely specified with knowledge of 4> (r). 

Op K 

We also emphasize that the noise source ^" o p(r, t) or F k (t) in this differential equa- 
tion description is a thermal noise associated with the filter system. Other possible 
independent noises can also be introduced. They will be considered in Section I-E. 


4. 1. 2 Markov Case 


We next investigate the case in which 


C k (t, t ') = 6(t-t') 2J k (t) 

1 


(213) 
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( 214 ) 


C k + (t,t’) = 6(t-f) 2J k (t) 

.rV 


& 




6(t-t') 2J^(t) 


(215) 


for the noise source (r,t) driving the field (193). In this situation the F, (t) are 

Op K 

independent white Gaussian observables so that each b^(t) is a component of a vector 
Markov process, as discussed in section 3. 2.4 (Part I). This vector quantum process 
is formed by b^(t) and its higher derivatives. 

Following the same discussions as in section 2. 1. 2 (Part I) we consider here the 
case in which involves only first-order time derivatives. The vector Markov case 
is treated in Appendix B. Thus we have 

(X. k +J?j) b k (t) = e k (t) + F k (t) (216) 


<F k (t)> = 0 

<F k (t)FJt')> = 6 kk ,Mt-t') 2J k (t) 


< F k (t)Fj,(t')) = 6 kk - 6 <t-t') 2J k (t) 


F k (t),F k ,(t,) 


= 6 kk . 6 (t-t') 2{j k (t)-J k (t)}. 


(217) 

(218) 
(219) 


Again we find it convenient to introduce the notation 



( 220 ) 


( 221 ) 


( 222 ) 


(223) 


( 224 ) 
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and write 


( ~k + :£V ^k (t) = ^k (t) + -k (tK 


(225) 


A development analogous to section 2. 1. 2 (Part I) can be given for the quantum oper- 
ators. (We shall just give the most important points.) We assume a stable system in 
the sense of Eq. 55. Let 


ik + -i ft* £k (t » 


h k (t, t) = 0, 


i ft + *k (t) 


h k ( t. t) = I, 
then we have 

b k (t) = h k (t, t) e k (T) dr + n° p (t). 


(226) 

(227) 

(228) 


(229) 


n° P (t) = h k (t, t) F k (r) dr. 
The noise -source correlation is 


(230) 


<F k (t)Fj(t')> = 2D k (t) 6(t-f), 


(231) 


Pic'*' - 


jk(t) 

jf(t) 

J 2 ( t) 

J^*(t) 


The covariance 


^f(t,t>) = <n° P (t)n° P (t-) T > 
is related to the variance 4> k *(t> t) by 

^(t.f) = h k (t,f) ^(t'.f) t >f 

^(t'.t) = h^t.P) t>f. 


(232) 


(233) 


(234) 
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Also, we have 


= 2 /* M h k (t, T) D^(t) hj(t, T) dT 


(235) 


^ = 2D^(t) - A^t) <g l (t,t) - <j£ l (t.t) A*(t). 


,qt/ 


- A&t 


(236) 


Derivations of (234)-(236) follow in the same way as those in section 2. 1. 2 (Part I). Sim- 
ilarly to the classical case, the relations (234) and (236) will be our quantum -mechanical 
fluctuation dissipation theorems for quantum Markov processes. The discussion in sec- 
tion 2. 1. 2 (Part I) applies here in an identical manner. Further discussion is given in 
Appendix C. 

We wish to give here the fluctuation -dissipation theorem for fields in coordinate- 
independent form. Let G(rt;r't') be the Green's function of 


(if +JS? ) 4, (r, t) = E(r, t) + ^ (r, t) 

~ I C — Op OP 


op 


(237) 


under the space boundary conditions of interest and zero initial conditions. The nota- 
tion 


2 <)>, 


: (r) b k (t) \ 




2 <j>*(r) bj(t ) J 


(238) 


and E(r,t), 2F (r, t) is obvious. This Green's function can be expanded in the form 
— — op 




G(rt;r't') - 2 ^(r) 4> k (r') h k <t;t')'. 

k ~ . 


(239) 


It is now straightforward to obtain from (234) and (239) the following distributive 
fluctuation-dissipation theorem for the output field 4[ op (r, t) 

Theorem 9 

The two-time field covariances are related to the one-time field variances by 


( 4Pp(r, t)4f p(r', t’)> = / dr" G(rt; r ''t')<^ p (r'', t')^(r', t')); t > t* (240) 


-op' 


<^ p (r,t')^ p (F\t)> = f dr"<^ p (r,t')^ p (r",t)) G T (r"t;r't'); t > P. (241) 


Here the notation (F, t) implies that the mean has been subtracted out. To establish 
(240) we can multiply both sides of (234) by 4> k (r) <j> k (r') and then sum over k. The right- 
hand side of (240) follows from simple manipulations. The fluctuation -dissipation 
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theorem for the noise -field correlation can be obtained from (236) similarly. 

4. 1. 3 Stationary Case 

In the stationary case it is more convenient to consider directly the electric or mag- 
netic fields. Let 

* (r.t) = 2 <|> k (F) Q k (t) (242) 

k 

be the electric operator obeying the wave equation 

1 +Jgf’ 2 ) g (r, t) = E(r, t) + ^ (F, t). (243) 

Note that Q k (t) is a self-adjoint operator so that E(r,t) is now real. The space-time 
invariant operators if ^ and if^ involve then only real coefficients too. The correla- 
tions of the self-adjoint noise field (r,t) is given by 

<^op (F ’ t) ^op (? '- t,) > = * c i^') <hc (?) 4 ’k (? ' ) (244) 

f ir op (?lt) ’ ir op (?, ’ t,) l = 2 c 2 (t_t,) ♦k (?) *k (?,) - (245) 

k 

A decomposition into spatial normal modes, similarly to previous cases, yields 

(Xk+if^ Q k (t) = e k (t) + F k (t) (246) 

for real excitations e k (t). The noise source F k (t) are self-adjoint operators and have 
correlations, from (239) and (240), given by 


< F k (t)F k- (0) > = c l (t) 6 kk' 

(247) 

< F k (0)F k ,(t)> = {C*(t)-C*(t)} 6 kk , 

(248) 

The higher cumulants of F k (t) are taken to vanish so that F k (t) are 
observables. 

The observables for each mode k are functions of Q k (t) and 

independent Gaussian 

A dQ.(t) 

P k (t) = dt • 

(249) 


The observables Q k (t) and P k (t) would have quasi densities, for example, the Wigner 
distribution, that are Gaussian. See Appendix F for further details. It should be 
clear that properties of Q k (t) and P k (t) are completely specified by those of F^(t) and 
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the impulse response h^(t, t) of (240). 

The commutator between the conjugate observables 

[Q <t),P (t')] 


( 250 ) 


has now to be given or determined in place of (231). Instead of (194) we have 

[ Q k (t) . P k (t)] = ifi. 

where R is Planck's constant. 

We define the Fourier transform of an operator A(t) by 

AM = e itot A(t) dt 


A f (co) = e iayt A f (t) 


dt 


(251) 


(252) 

(253) 


and the spectrum by 

(AMA^M) = C dt e~ ia)t <A(0)A t (t)> 


(254) 


(A^MAM) = dt e~ la)t <A t (t)A(0)). 


(255) 


When A is self-adjoint the dagger notation in the spectrum just denotes where the time 
dependence belongs in the correlation function. The following quantum fluctuation- 
dissipation theorems hold^’ 1° 7 as generalizations of Eqs. 68 and 69. 


<Fj(w)F k (u)) = 2finM ^f k («) 

<F MfJm) = 2fi{nM+l}i?*M 


<Q^(«)Q^(w)> = 2R n(co) Im «j 


•^ R M 


<Q' (cj)Q'^(w)) = 2fi{n(io)+l )} Im 




(256) 

(257) 

(258) 

(259) 


where 


1 


n(oj) 


Rw/kgT 

e - 1 


(260) 
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The parameter T is the temperature at which the fields are in thermal equilibrium. The 
frequency response ££ k (co) and -5f^.(co) is defined in Eqs. 65 and 68. The notation Q^(cj) 
indicates, as usual in our treatment, that the mean has been subtracted out. 

Q k M = Q k (u) - <Q k M). (261) 

For a discussion of Eqs. 256-259 see Appendix C. Other correlations of P k (cj) can be 
obtained similarly by noting that 

P k (co) = -iwQ k (w). (262) 

It is clear that (250) and (252) goes readily to the classical limits (Eqs. 68 and 69) when 

kgT » fiu>. 

To give the fluctuation -dissipation theorems for the field ^(r, t), we let 

<«?^(r, co)«f(r, «)) = Z <M?) <t> k (?')<QjMQ k M> (263) 

k 

<*'(r,u)* '*(?',«)) = Z <j> k (F) <t> k (?')<Q^(co)Q^{co)). (264) 

k 


The expressions £ (r, gj) are clearly time Fourier transforms of £ op (r, t), and the cor- 
responding spectrum is similarly defined as in (254)-(255). We also define the Fourier 
transform of the Green's function 

G(ru;r'o) = e* wt G(rt;r'o) dt. (265) 

We have then the distributive fluctuation -dissipation theorems for £ (r,t) in the fre- 

quency domain expressed by the following theorem. 

Theorem 10 

The spectra of £ Q ^(r,t) are given in terms of G(rw;r'o) by 

< £' ^(r, w) £ '(r ', g>)) = 2Rn(oj) Im {-G(rto; r 'o)} (266) 

< £ '(r, g>) £ '^(r ', o>)) = 2R{n(w) + l} Im {-G(rto; r 'o)}. (267) 


To establish (266), we multiply both sides of (258) by 4> k (r) ^(r 1 ) anc ^ sum over k. The 

right-hand side of (266) follows from noting that {.^(io)} - 1 is the Fourier transform of 
h k (t). Similar relations can be given for the correlations of F(r, to) which will not be 
discussed here. 
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Note that in this stationary case we can also consider the photon operators 
{fc> k (t), b^(t)} instead of {Q k (t), Q^(t)}. They are related in a very simple way. 



(268) 


(269) 


where" co(k) is the dispersion relation between frequency gj and wave vector k. Thus 
the results on {Q k (t), P k (t)} can be transferred to the variables {b^t), b^(t)}. We have 
used the present form here mainly for convenience. For further discussion see Appen- 
dix F. 

4. 2 Necessity of Introducing Quantum Noise Source and 
Preservation of Commutation Rules 


It is entirely possible and consistent to have 
(r, t) = 0 

in the classical wave equation (Eq. 1). Such a situation would occur, for example, when 
T = 0 in the stationary case, as is evident from Eq. 56. In contrast, we cannot set 

' (r, t) = 0 

^ op 

in the corresponding quantum case even when the temperature is zero, since the spec- 
trum 

<F k (co)F £(«)> 

of (257) is nonvanishing. This so-called zero-point fluctuation 7 ^’ arises from the 
commutator (251) and is a distinguishing quantum effect having no classical analogs. 

These zero -point fluctuations are always present physically. As we have just 
observed, they are intimately connected with the commutation rules for the field vari- 
ables. To insure the proper appearance of such quantum fluctuations, we have to insist 
on the preservation of commutators like (194). Mathematically, the validity of b k (t) or 
Q (t) as proper quantum observables also depends on such commutator conservation. 

K 

Physically the presence of such quantum fluctuations can be traced to quantum- 

mechanical energy conservation. We shall not elaborate on this point here. 

6) 9 *7 3 130 

We shall show that preservation of field commutator rules ’ ’ requires, in 

general, the introduction of operator noise sources in the wave equation (183). We shall 
consider, in contrast to previous cases, the general conservation of two-time commu- 
tators like (211) or (189). Some explicit formulas will be given for the noise-source 
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commutator that conserves the field commutator through the wave equation. Again we 
find it natural to divide our discussions into three cases. 


4. 2. 1 General Case 

In our general case we wish to determine the commutator (189) 


4* (r,t),4^ (r\t' 

mp mp 


= 2 4> k (r) 4 k V) C k (t.t') 

k 


(270) 


so that 


b k (t),bj,(t') 


6 k k ' C k (t * t ' ) - 


(271) 


When the noise field (r,t) is taken to vanish, the solution b, (t) of (206) can no longer 

Op K 

be expressed in the form (207)-(209) because the initial conditions have to be written 
explicitly. Similarly to Eq. 29, we can write 


n n r-1 d r_1 

bjt) = 2 2 (~l) r h k (t, t) 

r=l n'=l dr 


T— t 


i (t ) bP r (t ) + C h. (t, t) e, (t) di 
n-n' ok o J. k k 


(272) 


for the form of \ k + ^ given by Eq. 25. The commutator can therefore be evaluated 

as 


b k (t), bj,(t ') 


n n n n 
6 2 2 2 2 
r= 1 n '= 1 r'=l n"=l 




r-l 


dT 


— ^ h. (t, t) 
r-l k 


T=t 


o 


,r '-1 


dx 


r'-l k 


h, (t',T) 


.(t ) a „,,(t ) 
n-n o n-n o 


T— t _ 


,n'-r,. , , tn"-r / . , 
b, (t ), b, (t ) 
k o k o 


(273) 


from the initial commutators. We have assumed in (27 3) that the {b k (t Q ), b^(t )} commute 
between different k's. We shall also assume (210) in general. 

For a given system described by (183) and a given two-time commutator (271) it is 
clear that both sides of (273) are specified which will not be equal in general. We need 
always to have 


C k (t,t) - 1. 


(274) 


By setting t = t' in (27 3), the commutator (194) is again not preserved in general for a 
given (206). To see this in a simple example, consider the equation 
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db(t) 
~ dt~ 


-icob - b; 


V > 0 


with the solution 

b(t) = exp(-iut--2 t) b(0). 

The corresponding commutator 

\ 

[b(t), b*(t)] = e Yt 

is therefore decaying, and violating the conservation of (194). 

In general, the noise-source commutator (200) is determined by the condition (194), 
which amounts to solving the integral equation c 


1 = / dt'dt" h k (t, t') h k (t, t") 


F k (t')> F k(t") 


(27 5a) 


The two-time commutator (271) is then given by 


bJt), b,!(t 


= / dt"dt'" h k (t, t») h k (t',t'") F k (t»), F^(t"') 


(27 5b) 


One way to solve (27 5a) is to apply the eigenfunction expansion for h k (t, t) 

h k (t ’ T, = 2 

n n k 


Alternatively, we may convert (27 5a) to a differential equation for 


F k (t) ’ F k (r) 


(27 5c) 


In 


both cases the questions of existence and uniqueness of solutions, as well as methods 

87-89 

for finding them explicitly, can be studied by conventional methods. ” Note that the 
solution thus found depends on k in general, and hence the introduction of spatially non- 
white noise is necessary, even when the classical noise is 6-correlated in space. This 
possibility lies in the additional correlation (201) that we have quantum-mechanically, 
which is equal to (191) in the classical case. 

We have not yet produced the general solution of (27 5a) and the corresponding evalu- 
ation of (27 5b) which, although not entirely straightforward, seem to be completely 
within reach of existing methods. In some cases, Eqs. 27 5a and 27 5b may be directly 
determined from the system Green's function, the information presumably being given 
classically. The connection of this general case with the following special cases will 
be commented upon later. 

4. 2. 2 Markov Case 


In this case the two-time commutators are determined by one-time commutators in 
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the following manner. Once we are assured of the preservation of (194), the commuta- 
tor (271) follows from (234). 

b k (t)<b k (t,) ] = h k (t>t,) ’ t>t ' (276) 

b k (t). b J(t')j = h^t'.t), t' >t. (277) 


Therefore we have to choose 
J^(t) = 2{j^(t)-J2(t)} 


(278) 


to satisfy 

ft oo dT l h k (t ’ T) | 2 J k (T) = 1 ‘ 

When h k (t, t) satisfies (227) it is easy to see that (279) is solved by 
J^(t) = 2 Re A k (t). 


The corresponding two-time field commutator is then 


4< (r, t), 4/^ (r 1 , t') 

mp T op 


G(rt; r 't ' ) 


t > t 1 


= G*(r't';rt) 


t' > t, 


(279) 


(280) 


(2 81 ) 


(282) 


the Green's function of the wave equation (183). It is important to note that with a given 
equation (183) or (206) the conditions (281)— (282) or (276)-(277) are necessary for the 
amplitudes b^(t) to be Markovian. 

In many applications we may find that the real part of A k (t), the dissipative coef- 
ficients, is independent of k. It is then possible to have spatially white driving force 



to deal with strictly spatially non-white noise sources. 


4. 2. 3 Stationary Case 


The noise correlations (256)-(257) in this case are computed from (258)-(259), which 
in turn are directly derived from a large conservative system in which the observ- 
ables {Q k (t), P k (t)} are defined. 10 ^’ 107 The commutation rules (250) are there- 
fore automatically obeyed through (258) and (259). Explicit commutator rules will be 
given in Section I-E. 
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4. 3 Classical Description of Quantum Field Propagation 


The quantum Gaussian field ip 0 p(r»t) that has been discussed thus far is specified 
completely by the mean ( 4 J Q p( r > t)), the covariances 

< 4 op( ? ’ t ) 4 op( F, ’ t ')> (283) 

<^(?,t)4-; p (?',t')), (284) 


and the commutator (189). It follows from (210) that 

[4>(r, t), vp(F t *)] = 0. (285) 

A classical Gaussian field, on the other hand, is completely specified by the mean 
(ip(r, t)) and only the covariances 

<4d(F,t)4d(F',t')) (286) 


(^'V.tH'fr’.t')). (287) 

The difference is that the commutator (189) is zero classically. If, however, an arbi- 
trary function of the operators {ip^(r, t), ip(r, t)} is always written in such a way that these 
operators appear in a chosen order, say, in the normal order, then the commutator ( 189) 
need not be invoked subsequently and our problem is very much like the c -number 
description. This idea of ordering has been substantially exploited. 70-74 We shall indi- 
cate how it can be used to provide a c -number description of our quantum fields. 

Let {^^(t)} be the associated classical amplitudes of {b k (t)}, obtained from normal 

ordering to be specific. Let ^(F, t) be the associated amplitude of 4 1 (r, t), also for 

op 

normal ordering. Then 

i|i(F,t) = Z <f> k (r) P k (t). (288) 

k 

Since the system is linear, the classical and quantum mean equations of motion are iden- 
tical. This implies that ^(r, t) obeys a wave equation with impulse response G(r,t;F't') 
which is the same as that of (237). To characterize ip(F, t), we also need only to know 
(286)-(287), which we can take as (283)— (284). We have thus a classical field ip(r,t) whose 
properties are identical to the normal-ordered ones of ip (r, t). To specify ip Q (r, t) 
from ip(F,t) we just need to know (189). 

Therefore, with the commutator ( 189) given, we can develop the theory of linear 
quantum field propagation in an entirely classical manner as in Section I-B, by inter- 
preting the classical averages as properly ordered quantum averages. Since the sys- 
tem is linear, such a classical description is not much simpler than the q-number 
description. Operator representations that we have discussed will be required in Sec- 
tion I-E. The virtue of a c -number description in our case is that it emphasizes the 
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close relation between a quantum and a classical theory of field transmission through 
a linear system. Since the quasi densities that we use extensively obey classical equa- 
tions we are in effect employing a c -number description in many places. With the 
approach that we have described thus far it is not very important to distinguish 
q-number and c-number descriptions. 

4. 4 Quantum Classical Field Correspondence 

We shall establish the connection that will enable us to write down the quantum field 
specification from a given classical field. (Remember that the only difference between 
quantum and classical fields in our case lies in the commutator (189).) In case this com- 
mutator, like other two-time correlations, is specified by one-time averages through 
the fluctuation-dissipation theorems, we need only establish a correspondence between 
one-time quantum and classical averages. When a unique one-time classical quantum 
transition is set up the complete field correspondence also follows. 

It is worthwhile to emphasize that our aim is to give unique quantum channel specifi- 
cation from given classical specifications. Our quantum treatment is clearly appro- 
priate for quantum channel representation, similarly to the classical case discussed 
in Section I-B. The quantum averages required for the specification, however, may be 
difficult to obtain, depending on individual cases. Therefore we give these quantum 
averages from the classical averages, which need to be given in any case for a classical 
specification. Our development is again conveniently classified in three cases. 

4. 4. 1 General Case 

We consider a general classical channel as described in section 2. 1. 1 (Part I) to be 
given, with amplitudes P k (t) obeying Eq. 16. We first ask what the quantum system 
should be corresponding to a given Eq. 16 and Eqs. 19-21. Some physical assumptions 
will be employed in this connection. 

We assume that the system is linear both classically and quantum -mechanic ally, so 
that the classical and quantum equations of motion have the same form because no 
ordering ambiguity arises. Thus the quantum system will be described by the wave 
equation (183) with differential operators ^ ^ and 2 identical to those of Eq. 1. This 
is equivalent to the assertion that the mean response of both systems is obtained 
through the same Green's function G(rt;r't'). We then have an expansion (193), where 
each b^(t) obeys an equation (206), with h^(t, t) the same impulse response as that cor- 
responding to Eq. 16. Furthermore, the solution b^(t) can be written explicitly as (207), 
corresponding to Eq. 29. 

When f^(t) is Gaussian its third and higher order cumulants all vanish. It is 
therefore reasonable to choose F^(t) so that all higher cumulants of F^.(t) taken in any 
time operator order are also zero. Such observables F^(t) are, according to The- 
orem 6, Gaussian quantum processes. Since the f^(t) are independent for different k, 
we should also choose the FjJt) to be mutually independent. We have therefore a 
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quantum system described precisely as (206) and (200)-(204) corresponding to Eqs. 29 
and 19-21. We still have to establish the correlations (200) -(204) from those of 
Eqs. 20-21. 

Let us first consider the correspondence between one-time variances. Since the 
F^(t) are independently Gaussian, the outputs b^(t) will also be independent Gaussian 
processes, according to Theorems 2 and 3. It follows that the quasi processes cor- 
responding to b^t) have one-time P -distributions given by 



2r 


N, 


Vk 


k P k* (t) P k (t) + 



1289) 


with 


o-^t) =<[b^(t)] 2 > = 



<b^(t) 2 >* 


(290) 


r N, (t) = X^k (291) 

k k 


4 (t) = ( b k t(t,b k (t) >- (292) 

k 


where a prime on the quantity denotes that the mean has been subtracted out and the time 
dependence of the quantities can be conveniently suppressed. Note that the averages 
(290)-(292) in the P -distribution (289) are the normal-ordered averages. We shall com- 
pare (289 ) - ( 29 2 ) with the given classical distribution: 


'(Vk*) - 


27TCT 


k* 

cl 




exp 


cl 


^K k { ) 


❖ 

Pk w 

k* 2 

^cjg 


2r 


cl 


Pk (t) 


-W^k (t) Pk (t)+ -^T 


a cl (r cl 


cl 
(293) 


2 , “j ^ 

a cl (t) = <-Pk (t)2 > = [ a cl (t) J = <Pk>) 2 ) 


(294) 



(t) = <T 


k 

cl 


A 


k* k 
c£°c£ 


(295) 



(t) = <Pk* ( t)Pk ( t)>- 


(296) 


The use of P -distribution in (289) has the distinct feature that the classical limit 
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of (292) is the function (296). In contrast, 



= < b k (t)b k T(t) > 



( 297 ) 


should not be directly related to (296). 

Since the noise sources that we have dealt with thus far are the chaotic noise asso- 
ciated with the filter system, they can be expressed in thermal equilibrium in a special 
Gaussian form 

p n (w) “ ex P 


p k* (t) p k (t) 1 


cl 


( 298 ) 


Such a noise distribution arises from a canonical distribution with 



k B T 

fiicj(k) 


(299) 


where T is the equilibrium temperature. If we wish, we can also let T be k-dependent. 

< 131 

The corresponding quantum canonical distribution 


'(*V b k-‘) 


cc 


exp 


b k t(t) \ (t) 


c£ 


(300) 




suggests the usual replacement of cr^ of (299) by the Bose-Einstein distribution 


n k = 


i/cr 


cj a 


- i 


(301) 


for o-^ . When we allow the system to be in instantaneous equilibrium with a tempera- 
ture T(t) we have the relation 



(t) 


1 


e - l 


(302) 


Such a relation arises even when the same coefficients cr^ enter (299) and (300) because 
operator ordering is involved in going from (300) to a P -representation. 

In the more general case (293) the Gaussian noise distribution admits more gen- 
eral interpretation than equilibrium thermal noise. It can be generally considered 
to be the chaotic noise with distribution function obtained by maximizing the system 
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entropy, subject to the constraints (294) and (296) in addition to normalization. Sim- 
ilarly, the noise density operator 




2 

► exp < 


,2 \ 2 


J 

[H‘- r k KJ 


j 

(303) 


is obtained by maximizing the quantum entropy, subject to normalization and (290)-(292). 
Equation 303 follows from Eq. 116 for a proper normalization constant and with 



* 

’’k^k 


(304) 


In the two maximization problems above it is reasonable to set 


* k* k 

o" i _ c r /< t o* i — o - * 

k c S. k c £ 


(305) 


in the absence of other information. It is also reasonable to assume that the quantum - 
noise energy for mode k is given by 

fico(k) n k 

2 ~k 2 

so that (302) holds between o-^ and cr^ft). With (302) and (305) our one-time corre- 
spondence is complete. ^ 

It is important to observe that the quantum -noise photon number in the general cha- 
otic noise case, although still given by the Bose-Einstein form (302), may not allow the 
interpretation of ^^(t) 2 as in (299) with a time-variant temperature. In such a situa- 
tion we cannot go to the classical limit readily, and (302) may not hold. When the cha- 
otic noise energy distribution is thermal-like, so that (299) holds, our relation (302) 
becomes valid. In the other situations we can assume 



(306) 


without further information. 

Moreover, let us note that our unique one-time correspondence expressed by (305) 
and (302) or (306) is founded on two other assumptions, strictly speaking. The first is 
that the F'^(t) are taken to be Gaussian. It is possible to construct higher cumulants 
of F^ft) which go to zero in the classical limit, in the sense of setting 

fi - 0 (307) 
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or 


A 


n 

1-0 n 5 2 (308) 

etc. 

In this case the structure of the F^(t) is quite arbitrary and not necessarily Gaussian. 
The second is that expressions vanishing in the classical limits as (307) and (308) may 
be present in the right-hand side of (302), (305), and (306). These differences cannot 
be recognized in general, and the correspondence is therefore not unique in this sense. 
In the absence of specific information pertaining to individual problems, there is no way 
to improve our present treatment. Our Gaussian quantum -noise assumption would prob- 
ably be retained in many cases for the sake of analytic simplicity. Furthermore, it 
may be possible to get more unique correspondence by imposing additional physical 
properties on the system, for example, properties of the reservoir. 

Our argument giving the properties of {b^(t), b^(t)} at one time from the classical 
information may be regarded as a way of quantizing linear classical stochastic systems. 
We can adopt an alternative viewpoint in which our quantum field representation as 
described in section 4..1 is granted first. The one-time quantum variances would then 
be compared with the classical variances with the same results as for the other repre- 
sentation. The advantage of the latter approach is that it shows more clearly that our 
developments in section 4. 1 always retain their meaning for quantum -field modeling, 
even in the absence of classical knowledge. The quantum averages can then be deter- 
mined by measurements or by a full quantum calculation, depending on individual prob- 
lems. 

In this general case the output quantum field can be written . 


Hcj 

k B T 


^ (r, t) = /* 

op — 0 


dt' / v dr G(rt; r 't') [E(r ', t') + ^~ nn (r \ t')]. 


OP 


(309) 


This can be compared with the classical given field 


4j(r,t) = J* dt' / dr G(rt;r't , )[E(r't') +J r (r’t')]. 

' o 


(310) 


For a given classical G(rt;F't'), E(r l ,t l ) and (286)-(287) we have our equation (309) 
with the same Green's function and excitation E(r, t). The mean in both cases is there- 
fore the same. Furthermore, from (305) we have 

<4^ p (F,t)4P p (F,t)> =<^p(F,tH; p (F',t)) = <+'(F, tH'(F'.t)), (311) 

and when (306) holds 
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When (302) is valid we have to use a modal expansion 




1 


<+;! <r - t) V r, ’ t| > - f ♦£<?> V’> — rr 

e C£ - 1 


( 313 ) 


k “ 

which may be expanded in a power series in o^. The nature of ^"(r, t) or (r, t) is 
not important, insofar as it gives an additive noise. 

In this general case it is difficult to obtain multi -time correspondence in any unique 
fashion from the one-time results, even when the commutator (189) for ip (r, t) is 
given. When (189) is not given, there is no way to find this commutator in general from 
the given classical information. With the one-time correspondence (311)— (313), in the 
absence of better choices, we can assume 


( 4d(r, t)ip'(r ', t')) = <i|d(r,t)iK(r',t')> t * t' 

< 4/'^(r, t)4j'(r ', t')> =(+'*(?, t)(|»'(r', t')> t * t\ 


(314) 

(315) 


We have given the correspondence directly in terms of ijj^fr, t) rather than op ( r, t) 

because the noise source is usually quite singular. Furthermore, correspondence in 
+ 0 p(r,t) is more directly applicable to given classical additive noise specifications. If 
desired, we can also find the noise-source correlations from that of ^^(r, t). Since they 
are not needed in the present work we shall not pursue them here. 

4. 4. 2 Markov Case 

In the Markov case, the one-time correspondence (311)-(313) establishes a complete 
multi -time correspondence as follows. The commutator (189) is specified directly by 
the given Green's function G(rt;r't') as in (281) -(282). From Theorem 9 the relations 
(311) and (312) can be immediately generalized to read 

<^op (? * t) ^p (F ’ t) > =<+'(?> tM'^r'.t')). (316) 


For the case (302) we can also obtain the quantum covariances from (313) with The- 
orem 9. In this situation both the one-time and two-time quantum averages 
( 4^(r, t)4<(r ', t 1 )) are different from the given classical averages. As the quantum field 
is Gaussian, we have already arrived at a complete correspondence. 

In this Markov case the classical and quantum diffusion coefficients can be com- 
pared more directly. With a given classical ^(t, t) we can obtain D^(t) readily by 
Eq. 60. With the correspondence (302) or (306) we find 4>^(t, t) and D^(t) follows from 
(236). Again we need not be explicit about such relations. 


4. 4. 3 Stationary Case 


In this case the quantum classical correspondence is most easily established. Com- 
plete characterization is given by Theorem 10. It is important to note that in this situa- 
tion the classical specification can be given in terms of G(rt;r't') and only parameter T. 
We have a thermal-equilibrium steady state in general. The classical information is 
presumably given by (266), with n(co) replaced by k^T/Kco. Note that a quantum system 
is Markov or stationary only if its classical limit is also Markov or stationary. 

It is clear that with the prescription described here we can write down a quantum 
field specification corresponding to a given classical specification. Various communi- 
cation configurations can then be formed. They will be treated, together with nondif- 
ferential filters, in Section I-E. 


4. 5 Conclusion 

The basic idea of our approach is quite simple. To establish complete specification 
of a Gaussian quantum field from the classical field, we need to compare 

normal-ordered quantum averages with the given classical averages. One-time aver- 
ages can be compared by using a noise -energy distribution argument. Two-time quan- 
tum averages follow either from the one-time averages when fluctuation-dissipation 
theorems are available, or can be assumed to be the same as the classical averages. 

In any case, the commutator Eq. 189 has to be known. It is important to note that thus 
far only in the Markov or stationary cases have we been able to specify the commuta- 
tor from the given classical information. 

The necessity of introducing an operator noise source is not evident in our quantum 
classical comparison, but will be seen more explicitly later. This is required in gen- 
eral to insure that ^^(r, t) has the proper commutator (189), which in turn is necessary 
for i|< (r, t) to be a valid quantum operator. 

Since it should be clear from Eqs. 27 5a and 27 5b that the two-time field commutator 
is determined by the system Green's function, it is interesting to inquire how the spe- 
cial cases result from imposing additional properties on the general case. In the 
author's opinion, the existence of fluctuation-dissipation theorems is closely connected 
with the conservation of commutators (194). In fact, energy conservation should be the 
basis for both. There should be more general fluctuation -dissipation theorems that 
take particularly simple forms in the Markov or stationary cases. Efforts to seek such 
results are certainly encouraging. Further discussion will be given in Appendix C. 
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E. GENERAL QUANTUM -CHANNEL REPRESENTATION 


We shall give a general prescription for modeling quantum -mechanical communica- 
tion systems, including arbitrary transmitter-receiver configurations, independent addi- 
tive noise, and channel -dependent as well as signal-dependent noises. These factors 
will be studied separately but unified ultimately in a combined representation. A general 
yet simple procedure is then described for converting a given classical communi- 
cation system to a quantum-mechanical representation. Examples of such treatment 
will be given in Section I-F. 

We begin our consideration by generalizing the correspondence of Section I-D to 
channels described by stochastic differential equations. 

5. 1 Quantum Classical Correspondence for Stochastic Channels 

Our stochastic channel is described by a stochastic differential equation 

(£e^se z ) «J» (r, t) = E(r, t) + (F, t) (317) 

with an associated random Green's function 

G R (?t;F't'). (318) 

If we assume an expansion of the form of Eq. 82, our development in Section I-D 
remains largely valid by interpreting h k (t, t) and the relevant quantities as stochastic 
processes. With complete specification of h k (t, t) we can average the equations over 
the channel statistics and obtain whatever channel quantum total averages we want. We 
shall use the term channel statistics for the randomness in G R (rt;r't') to distinguish 
from noise statistics that arise from ^"^(r, t) and other independent additive noise. 

There is an important difference, however, which we will show later in commutator 
conservation of the kind 


[b, b*] = 1 


(319) 


that cannot hold as a stochastic equation in general when h k (t, t) is random. 
Commutation-rule preservation can therefore only take the following form 


[b,b^]=l (320) 

after averaging over the channel. Similarly, the field commutation rule can only be 
interpreted as an average 


V r ’ t), 4 j op <r, ’ t ' ) 


= 2 <j>(r) 4»(r) C(t,t') 


(321) 


b k (t)> bj(t’) 


^kk-Gk^t'). 


(322) 


7 1 



Preservation of such commutators, in particular (320), is again required for quantum- 
mechanical consistency. 

We shall establish, separately for the three cases that we treat, commutator (321) 
conservation and quantum classical field correspondence. Note that when the correla- 
tions of (r, t) are specified together with (318), our development in Section I-D can 
be immediately considered to be a proper description of quantum stochastic field trans- 
mission through a linear system. 

5. 1. 1 General Case 

Commutator preservation in this case can be achieved by solving the nonrandom 
[ F k (t),F k (t,) ] from 

C k (t ' t,) = [ b k (t) ' b k (t,) = dT /-I ds h k (t ’ T) h k (t ’ >s) [ F k (T)> F k (s) • (323) 

In the nonrandom case this amounts to solving an integral equation for a function of two 
independent variables. 

Note that the commutator F k (t),F^(F) so determined is nonrandom and depends 

only on the channel statistics. In the deterministic channel case it may happen that the 

other correlations of F k (t), Eqs. 194-204, depend on h k (t, t). When h^(t, t) becomes 

random we shall have strictly random correlation functions for F k (t). To simplify the 

analysis, we will not consider the case of higher randomness and instead regard the 

correlations of F k (t) as given nonrandom functions. 

The one-time quantum classical correspondence of section 4. 4.1 (Part I) breaks 

down here because the additive noise in this case is the noise source SF (r, t) filtered 

op 

through the random Green's function (318) in such a way that channel statistics plays a 
role. In such a situation it is more convenient to compare the correlations of S r Q ^(r, t) 
and SF (r, t). In the absence of better choices we have to equate the normally ordered 
average of .^^(r, t) to that of ^"(r,t). 

Let us write 


^op (r,t) = -f-oo dt ^ 1/2 dr G k (rt;r't , ){E(r,t) + ^ r o p(r, t)} 

in the "signal plus noise" form 

*|> (r, t) = I 1 dt' / v dr G R (rt; r't') E(r',t') 

p o 2 



(324) 


( 325 ) 
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where 


G^(rt; r't') = G R (rt; r't') - G R (rt; r 't '). 


(326) 


The additive noise 


n °f( r ». t ) = /-co dt ' /y dr G R (rt: r't') &~ nn (r', t 


& 


R 


OP 


(327) 


is still Gaussian, but may be correlated with the signal carrying "noise" 


n E (r,t) = dt' / v dr G^rtjr't') E(r',t'). 
o 2 


(328) 


When G(rt;r't') is again given and normal -order averages of (r, t) are identified 
with those of ^ (r, t), the properties of (r, t) are completely specified in the form 
(325) with (321) and E(r, t) given. 

The most crucial element in setting up a quantum field specification from a given 
classical specification is the derivation of (321) from the classical information. We have 
discussed in Section I-D and in Appendix C how we may be able to employ various 
analyses to derive (321) in general. In the absence of such knowledge, this field com- 
mutator has to be given, or we have to resort to the following cases. 

5. 1. 2 Markov Case 

In the Markov case (276) -(277) remain valid when both sides of the equations are 


interpreted as random processes. In fact, in such a case 


b k (t), bj(t) 


equals unity 


without the need of an average because the random process h k (t, t) was assumed to 
satisfy the deterministic initial condition (Eq. 42). If, however, we leave the commuta- 
tor (189) and (211) random, we shall not be able to establish a photon operator nature 

for functionals of ^ (r, t), as will be evident in section 5. 4. In order not to deal with 

op 

a random commutator it is sufficient for our purpose to require average conservation 
of a commutator like (321). It is then evident that 


b k (t) ’ b k (t ) 


= h k (t,t'), 




b k (t), b^(t’) 


= h k (t',t). 


t > t' 


t' > t 


(329) 

(330) 


and the commutator 


F k (t),F^(f) = 6 kk ,6(t-t') J^(t) 


( 331 ) 


can be chosen to be nonrandom, and hence to satisfy 
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fl„ dr ]h k (t, t) I 2 J k (T) = 1. 


(332) 


(333) 

(334) 

Similarly to the general case, we shall take diffusion coefficients of F^(t) that are 
nonrandom, although possibly dependent on the channel statistics. Moreover, one-time 
quantum classical correspondence will be obtained by equating normal-ordered diffusion 
coefficients to the given classical coefficients. 

It is important to observe that the two-time averages are not as simply related to 
the one-time averages as they are in Theorem 9, which can only be generalized to read, 
say, for t > t', 

(n (r, t)n^ p (r', t')) = / dr " G R (rt; r"t')< n op (r ", t')n^ p (r', t')) (335) 


Furthermore, from (281)-(282), we have 


4*op^ r * t)> 4 J Qp( r *» t') = G R (r, t; r't') 


= G R (rt; r't') 


but not 


<2 (r.t)n^ D (r',t')) = / dr" G R (rt; r"t')< n (?", t*)nj Jr', t')>, 


where 


n (r, t) = / G n (rt;r't') (F,t) dr', 
op xt op 

— T — 

The reason for this is that the one-time average ( n Qp (r, t) n Qp (r, t)) depends also on 
h^(t, t), as is evident from Eq. 235. With a given specification of G(rt;r't') it is pos- 
sible, however, to compute two-time averages from the ' one-time result through 
Eqs. 335 and 235. 

When the classical and quantum diffusion coefficients are identical it is clear that 

the normal-ordered variances of 4 J 0 p(i'> t) are the same as the classical variances before 

channel averaging. From (33 5) it follows that the normal -ordered covariances of 

Jj (r , t) are also the same as the classical covariances. The antinormal covariance 
op 

can be obtained through the normal one by using (333)-(334). It is more important that 
with (333) and (334) and given diffusion coefficients, our quantum Markov field is com- 
pletely specified as (325). In contrast to the general case, it is significant that in this 
situation we know the field commutator from G(rt;r't'). 


5. 1. 3 Stationary Case 

Preservation of commutation rules is a simple matter in the stationary case. The 
noise source (r, t), having a commutator 
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(336) 


.• r op (r,w) ’ jr lp (r, ’ Cj) 


= 2fi Z 4> k (r) <t> k (r')if k (co), 
k 


preserves all of the commutation rules for <^ 0 p(r,t). The correlations of as gi ven 

by Eqs. 74 and 7 5 would be random, however, when •Sfj'(tj) is random. In order not 

K j 

to deal with random correlations we can either assume ^ k (w) to be nonrandom or 
set n(io) = 0, so that we have no classical correlations. 

In any case, the commutator [Q, (0), P, (t)] can be simply evaluated from 


Q k (0),P k (t)] = iR £h k (t) 


[Q k (0),P k (t)] = ifi£h k (t). 


(337) 

(338) 


If we let 


Jr.t) = 2 4>, (F) P,(t), 


op 


(339) 


it follows from (338) that 

[<f op (F, 0), i op (F\ t')] = iR 2 ± G R (rt; F'O) (340) 

so that the field commutator is determined. 

The classical quantum correspondence can be set up in this case by proper identifi- 
cation of the system temperature T. In general we also have 

( n op(F’ “) n o p(?'<“)> = 2HHM Im {-G R (Foo; F'O)} (341) 

( n op (F, cj) n£ (F\ w)) = 2fi{n(u)+l} Im {-G r (Fw, F'O)} (342) 

for the signal -independent noise 

F k (w) 

n (r,«) = Z 4> k (r) — . (343) 

° P k k jsr k («) 

Since the essential difference between quantum and classical fields lies in the pres- 
ence of commutators in the quantum case, we should recognize that Eqs. 333, 334, and 
340 are of paramount importance in our quantum classical correspondence. This will 
become more apparent in our discussion of stochastic signals. 

5. 2 Stochastic Signals 

Consider Eq. 317, and now take the excitation E(F, t) to be a random field. Let the 
mean of E(F, t) be denoted E(F, t), and 
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(344) 


E'(r,t) = E(r, t) - E(r,t). 

Here we use a wavy underline to indicate stochastic signal averaging. We then have 

(STj + ST^) (F.t) = E(F,t) +JT (F,t) + E'(F,t) (345) 

1 C Op av Op 

which differs from (317) in the presence of an additional noise source E'(r,t). As in 
ordinary classical cases, the signal information may enter through this E'(r,t). 

Since E(r, t) is completely classical, it causes no disturbance to the field commutator 
and the quantum classical correspondence. Our previous development can therefore 
include this case in a simple way. Similarly, another classical pure noise source 


^"(r, t) can also be included in (345). 

Consider now the situation in which we have a wave equation 

(JS^+JS^) (F.t) = E(r, t) +Q (F.t) (r,t) + JRF, t), (346) 

where Q (F, t) is also an operator source with completely specified stochastic prop- 
erties. We therefore have an operator stochastic signal. Let us define 

«y F,t, “ V ? ' t)_<Q op (F ’*» (347 > 

E(r,t) = E(r,t) +<Q(r,t)> (348) 

■rsj 

^ (r, t) = JZ"(F,t) + E'(r, t) (349) 

(r, t) = (r, t) + Q' (r, t) (350) 

^ op op op ' 


so that (346) becomes 

/V 

(i^ + i^) tp o (r, t) = E(r, t) + (r, t) + (r, t). 


(351) 


Suppose that we want to retain a given field commutator (321) for ip (r,t), and in 

OP 

particular still wish to conserve this commutator. The commutator of ^“^(r, t) has 
then to be chosen as we chose ^"^(r, t) before. For a given Q Q p(?, t) this is equivalent 
to a choice of the commutator for ^"^(r , t) through (350). In particular, when (r, t) 
or Q op (F, t) are independent 


(r, t), ^ (F.t) 

op op 


# op (F. t).# JpCF. t)] - [<a op (F, t), Qt p (F, t) 


(352) 


When we are restricted to Gaussian quantum noise so that Q^p(F, t) is Gaussian there will 
be no distinction between operator stochastic signals and classical stochastic signals, 
as is evident from (351). Note that (r, t) may now contain signal information 
through Q' (F, t). 
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At this point is is worth noting that a Gaussian quantum noise source (r, t) can 
always be separated into an intrinsic quantum component plus a classical component 


*» 


r,t) +#'(?,!) 

op 


(353) 


with 

‘ijfop! 7 - *»> = ° - <eo P (? ’ *» 

<eo P (? ’"C t op (F ' t » = <^op (F ’ t »^ P (? - t, > 

'-'H 

t )f(F, t)> =(J r op (r, t) ^ op (F, t)X 


(354) 

(355) 

(356) 

(357) 


We are therefore putting the quantum nature, and in particular the commutator of 

SF (r, t), on the new source 3F (r, t), whereas ^"(r, t) carries the correlations that 
op <**■' op _ 

exist classically. It is now more apparent why an operator source Q q (r, t) acts like 
a classical stochastic signal. In our following treatment the separation (353)-(357) has 
other conceptual advantages. 

It is important to note that it is possible to have a prescribed commutator (319) dis- 
turbed by the introduction of an Q Qp (r, t). In such a case we choose the commutator of 
^plr, t) in (346) to get (321), which is then modified by Q Qp (r, t). It is usually rea- 
sonable to retain the t>^(t ) as photon operators, and in particular to retain the commuta- 
tors (333), (334), and (340) for the Markov and stationary cases. The situation is then 
just as described above. 


5. 3 Other Additive Noise and Noise Sources 


We have now discussed both classical and operator noise sources added to the wave 
equation (346). These noise sources can be assumed to be independent, but they give 
rise to dependent additive noises through a random G^(rt;r't'). With properly specified 
statistics the output tj; (r, t) is completely defined. We can write 

4 J op (F. t) = df / y dF' G R (Ft; F't'){E(F\ f) +JRF', t') + ^ op (r\ t')}. (358) 

It is clear that other added classical noise sources can be lumped in the same man- 
ner. Further added quantum noise sources are effectively the same as classical 
noise sources, if we insist on a specified field commutator (321). The commutators 
of different quantum noise source components are therefore unimportant, as in the 
situation in Theorem 5. 

It is possible that the added noise sources are not diagonal in ^(r), or stated 
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differently, do not have spatial modes identical to those of the original noise 

sources. Analysis is more complicated, but the quantum nature of the situation is 
unchanged or known; the commutator (321) is either the same as before or changed 
in a specific way. 

One can also add to (358) other independent additive noises 



+ n(r, t) + n (r, t). (359) 

The classical additive noise n(r, t) clearly causes no trouble. The quantum noise 
n (r, t) may modify (321), however. Again if we prescribe (321), the commutator for 
(r, t) can be chosen to set (321) for arbitrary given n Q p(r, t). Here the situation is 
the same as for the introduction of Q 0 p( r * t). The quantum classical correspondence 
is clear in the stochastic signal cases. When the commutator (321) is specified it is 
also clear in the case of additive noise (359). In particular, the correspondence (313) 
may apply to normal-ordered average of n (r, t) other than (312). 

It is important to note, however, that the additive noise n Q (r, t) or n(r, t) does 
not obey fluctuation -dissipation theorems, which are derived for Hamiltonian sys- 
tems. Such theorems apply only to noise sources filtered through the Green's function 
of a differential equation. The quantum classical correspondence is still complete 
because we know the properties of this filtered noise from the fluctuation -dissipation 
theorems and the properties of additive noise that are given. 

5. 4 Quantum Classical Correspondence for Nondifferential 
Filter Channels 

A general nondifferential system as discussed in section 2. 2 (Part I) can be 
expressed as 

4j (r, t) = / G„(rt; r't'){E(r \ t') + ^”(r ', t')} dr'dt' + n (r,t)+n(r,t) (360) 

op it op 

for a possibly random filter G^(rt;r't') that does not arise from a differential equa- 
tion. The signal E(r,t), which may be stochastic, is a classical field. The additive 
noise contains an operator, as well as a c -number component. The commutator (321) 
is given by that of n (r, t). 

In this case it is clear that quantum classical correspondence can be obtained 
as before with the important difference that the commutator (321), or equivalently that 
of n Q p(r, t), has to be given. When only classical information on (360) is given there 
is no way, as in our general case, to tell (321). Since the commutator (321) is extremely 
important in the quantum representation, it is unfortunate that it cannot be related to 
G R (rt;r't'). In our general case it may still be possible, as discussed before, to 
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relate (321) to G„(rt; r't'). In this nondifferential case, on the contrary, there is no 

x\ 

way to find (321). It is clear that the fields ip (r, t) ultimately have to obey differential 
equations from the laws of physics, and we have to go back to the physical situation in 
which we approximate the system as (360) to see what kind of differential system it may 
come from. This implies that for transition to quantum description the communica- 
tion system should be described from the viewpoint of physical differential equations. 
Further discussion of classical quantum transition will be made later. 


5. 5 Channel Representation for Different Receiver-Transmitter 
Configurations 

We shall now show how different channel models emerge from different types of 
receiver -transmitter configurations. Generality and flexibility of the possibilities are 
discussed, and some simple examples are given. 


5. 5. 1 Theory of Receiver Input Representation 


Suppose that a field ip (r, t) is given in the general form (358) with known (321). Con- 

°P _ 

sider the linear functionals of ip (r, t) 

op 


a = / v Vi drdt w ( r -t) + op ^,t) 


r t 


(361) 


* _ 


r t 


= / v V drdt W'(r, t) pj p (r,t). 


(362) 


We are interested in determining the density operator that gives the outcome probabilities 

for measurements of observables that are functions of a and a^. It is clear that 

(361)-(362) represent completely general linear functionals of p (r, t) and p' (?, t) by 

op op 

various choices of W(r,t) including generalized functions. We can therefore first 
develop a general representation for such (a, a^) and later specialize to different 
measurements corresponding to particular W(r,t).. 

The function W(r, t) and the range of observation {v^, V^} reflect the receiver con- 
figuration in this case. The range 




(363) 


specifies the region of space and time in which we observe the output signal 4* (r, t) of 

the channel. In particular, V gives the physical size of the receiver. The function 
_ r + 

W(r,t) can be chosen for convenience of observation. Note that a and a 1 are in general 

space -time -dependent. When 


W(r,t) 


6(r-r Q ) 6(t-t Q ) 


(364) 


we are observing the field directly. When W(r, t) is constant we are observing the field 
integrated over a given space-time region (363). The transmitter configuration is 
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given by the domain of the excitation E(F, t). We shall eventually discuss the class of 
measurements included in such a description, and also the problem of measurement 
probability calculation for a given observable incorporating the receiver configuration. 

We shall first discuss the actual implementation of the integrals of (361) and (362). 
For a given W(r, t) there may be many physical ways of actually realizing the integrals. 
The essential problem is to determine whether additive noise has been introduced in 
such realizations. For different specific implementations there is no general way to 
tell the nature of the additive noise if it has been introduced. In practical applications 
we have to investigate the actual receiver action on the field. 

There is, however, a rather general approach by which the integrals (361) and (362) 
can be realized and the corresponding additive noise determined. This involves passing 
«|j (r, t) through a matched filter H(rt;r't') defined for a given W(r, t), by 

H(rt; r't') = H(r-r t-t 1 ) 


= 0 


R-r +r' e£ V 
r r 

T-t + t' £ V 

r < r' 

_t < t' 


= W(R-r+r'; T-t+t 1 ) otherwise. (365) 

This filter is space -time -invariant and its output sampled at t = T and r = R is 


a = W(r,t) t|) (r, t) drdt 


(366) 


in the absence of additive noise. Since the filter is zero in the proper region, the a 
of (366) is the same as that of (361). 

For this specific implementation of the integral (36 1)-(362) the minimum noise that 
need be introduced is given by the fluctuation-dissipation-amplification theorems of 
Appendix C. In order that the fluctuation-dissipation-amplification theorems apply, we 
must be able to interpret the filter H(r-r';t-t') as the Green's function of a differen- 
tial equation. Detailed discussion of such attenuation or amplification systems will be 
omitted here. The important point in this connection is that no additive noise need be 
introduced if the integrals ( 36 1 ) -(36 2 ) do not correspond to amplification of i|j (r, t). In 
case amplification is involved, a noise will be added to (36 1)— (362) which is specified 
by Eqs. C. 2, C. 3, and C. 4. Since the a of (361) is related linearly to 4^ (?, t), it 
appears that our fluctuation-amplification theorem provides the limit noise required in 
any implementation of the integrals. 

Generalizing (361)-(362), we therefore write 


a = jy v drdt W(r, t) + op (r, t) + n 


r t 


op 


(367) 
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(368) 


at = ^ V r V t drdt W * (r ’ ^ V (r * ** + "op’ 

where n Qp can vanish, depending on W(r, t). The n Qp , which is independent of t), 

is now assumed to be given. The commutator 


[a, a t] = / drdF'dtdt' W(r, t) W*(r', t 1 ) 4< nn (r, t), 4^ n (r', t ') 


+ [n, n^] (369) 


'op' " " op 

can be calculated from (321), given W(r,t) and [n, n^]. Let us first assume that 


[a, a T ] = 1 


(370) 


so that a can be interpreted as a photon operator averaged over the channel. Suppose, 

first, that the channel is fixed. Since 4* (r,t) and n are Gaussian, we have also 

op * °P 

a Gaussian a, from Theorem 2. Let a and a be the associated classical amplitude 
of a and a^. We can then form the distribution 


P(a,c ) = 


2770*0- /T^ 

a a y 


N 


r 

1 

fa '* 2 

* \ 
2c ' °' r N ,2 \ 

2 (' -r N ) 

L CL 

*2 

\ ff ° 

* + 2 If 

c r tr cr 

a a a j 


/ 


(371) 


where 


a' = a - f v v /drdt£ 00 dt' J v dr' W(r, t) G R (rt;r't') E(r,t) 


J V V. 
r t 


(372) 


2 , ,2 X ( *2 \* 

% =< G )= Va ) 


= fy v W(r,t) W(r',t') dr'dt' <^ p (r,t)^ p (r',.t')> + <n^> 


(373) 


°N = < G '* G '> 


/v r V t W * (r,t) W(r, ’ t,)dr ' dt, < + Ip (r ’ t) V r '’ t,)> +(n op n op > 


(374) 


r N =4 /a l v a 
a a 


4PJF, t) = + nn (F, t) - dt* / v dr 1 G R (rt; r't') E(r', F). 


'op 


op 


R' 


(375) 

(376) 


For a fixed deterministic channel with nonrandom G^(rt;F't'), the distribution (37 1) 
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is the P-representation of the density operator p(a, aA that we seek to determine. The 

reason for this is that a, the linear transformation of ib (r,t), also possesses a com- 

+. °P 

plete set of eigenstates. Together with [a, a'] = 1, it is sufficient for 


p(a,aA = / d 2 a |a) (o| P(a, O. 

An explicit demonstration is given in Appendix G. 

When G^(rt;r't') becomes random the condition (370) allows us to interpret 


(377) 


P(a )C * = f P(«,«*)p(j4 n ) 


k 

smn 


(378) 


as the P-representation for the channel-averaged density operator p(a, a') describing 
measurement probabilities of functions of (a, a A derived from a random channel. In 
Eq. 381 we shall assume that the random filter G^(rt; r't') is given in the form of 
Eq. 84 with a joint distribution p(g|An) for the random coefficients {g^ n }- 
Similarly, for a set {k}, we have 


a k = V v. % (r ’ t} w k (r ’ t} drdt + n op’ 


r t 


op 


(379) 


with 


a k’ a k' 


kk' 


[ a k > a k ,] = 0. 


(380) 

(381) 


For a fixed channel the joint P -distribution P(a, a) for the associated classical ampli- 
tudes { a k >' a k J can de directly calculated for any order among the a^, as they are jointly 
Gaussian. The parameters in P(o, a’') are determined from the statistics of i|j (r, t) and 
n . We summarize this in the following theorem. 

Theorem 11 

The distribution 


— * 
P(o, a ) 


/ P (c, a*) p(g^ n ) dg; 


k 

mn 


(382) 


is the P-representation of the density operator describing measurements of observables 
that are functions of a, a A where a = {a^.} denotes the set of operators {a k } of (379)- 
(381) collectively, and a_ is the associated amplitudes of a. 

Note that according to Theorem 11, the density operator p(a, a A can be constructed 
only from given statistical specification of ib (r, t) in the normal order, together with 
the commutator (321). See Appendix G for a more detailed elaboration of this possibility. 
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When the a^ do not commute for different k. 


[ a k’ a k'l * °> 


k * k' 


a k > a k . 


* 0 , 


k * k' 


t * 

the density operator p(a, a') may not be simply related to the P(a, a_ ) calculated in this 
manner. In this case a specification of (r, t) in terms of its modal amplitude den- 
sity operators will be necessary. (See Appendix G.) 

+ 

It may turn out that we have operators (a, a'), as given by (367) and (368), which 
have 


[a, a] * 1 


(383) 


[a,at]*l. (384) 

In such a case we cannot interpret a distribution like (371) as the P -representation of 
p(a, a^), but proper scaling of the variables a to insure (370) can be achieved, since 
(383)-(384) are c-numbers. The resulting P -distribution so constructed would in gen- 
eral be quite different from (371). The case of many operators a_ can be handled sim- 
ilarly. 

It is now clear that the commutator (321) is needed for construction of the density- 

+ # 

operator representation p(a, a'). The specific form of P(a, a ) is influenced greatly by 
different commutators (321). 

Let us now consider what class of measurements has been included in (384). Con- 
sider measurement of an observable 




(r,t),4^ trt 

op ^op 


(r, t)) 


i r* 


which is an arbitrary function of ip o p(r, t) and ^(r, t). The receiver configuration is 


now 


built into the form of 8 fib , V When 6 (^b , di^ ^ is a nonlinear function of 

\ op op/ \ op op / 


(r,t) a function W(r, t) may not exist such that 8 
op + 

terms of a and a only. With 


(%’^op) 


can be expressed in 


\ / , 4 - \ 

a set a^ of (384) the observables 0(+ O p> 4^ j can 
be expressed, under some broad conditions, in terms of a^ through 4^ » when 4* 0 p is 

expandable in terms of a, . In this case we can form density-operator representations 

k t 

for the a^ and then calculate the measurement probability for 6(a, a'). This pro- 
cedure is inconvenient, and simpler methods may be available depending on par- 


ticular 8 


(4* , 4^ ^ 

\ op ^op / 


and the statistics of 4'(r, t). 


We are unable to develop a simple 


convenient procedure comparable to the one above which applies when the receiver 
structure is reflected in the W^r, t). 

To illustrate the situation further, let us consider the following energy measurement 
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which is frequently employed in practice. Here we make a direct measurement of N, 
where 


N 


/ i|i| (r,t) 4> (r, t) W(r, t) drdt. 


This variable N cannot be readily expressed in terms of a and a^ of ( 36 1) -(362) in 
general. When we expand i|j (F, t) in terms of a. , N can also be given in terms of a. 

Op K a. 

but the resulting probability calculation for measurement from p(a, a 1 ) is inconvenient. 
In the presence of other information, for example, when i|r (F, t) ^^(r, t) has Poisson- 
distributed eigenvalues, we can make a direct calculation of measurement probabilities 
without using p(a.a^). 

We wish to point out here that under general conditions we can always find a set {a^} 
describing the field vj; (Ft) completely in a convenient manner. That is, there exists 
a canonical representation for the receiver input field 


4* (r, t) = 2 a, 4>,(r, t), (385) 

op k K 

where 4> k (F, t) is proportional to the eigenfunction of a linear integral equation whose 
kernel is 

[+op (ft) *^p (?, ‘ t,) . 

so that (380) and (381) hold. When the joint variances of {a k } factorize in any order, 

we have a set of independent quantum observables {a^} which specifies 4 J 0 p(?» t) in much 

the same way as the coefficients in a Karhunen-Lofeve expansion of a classical random 

field. In particular, the density operators of each mode a^, which play the role of 

probability densities for the coefficients in the classical case, can be calculated in the 

way described above. Any set {a^} that does not obey (380) can still be considered 

as a linear combination of the {a^} in (385) so that the representation applies to any 

receiver configuration. This canonical quantum description thus parallels the "covari- 

12 

ance function -impulse response" type, which is a common approach to classical 
detection and estimation problems, although our system is described in a "state - 
variable -differential -equation" approach. 

As the transmitter configuration is contained entirely in the form of the excitation 
E(F, t), our complete system representation 

+ op (r.t) = / G R (?t;F't')[E(F 1 ,t I )+^ 0 p(F',t 1 )] dF'dt' + . . . 

fully parallels the ordinary classical channel description. Note that an explicit physi- 
cal field description in the H -picture is required for such parallelism. 

Our procedure makes it evident that we can form a variety of density operator repre - 
sentations for different types of measurements. In general, any receiver configuration 
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can be treated at the expense of complications in analysis. This important feature makes 
it possible for us to study the optimal receiver measurement for any given receiver 
size. Before studying these questions we want first to give some simple examples 
illustrating our procedure. 

5. 5. 2 Examples 

Let us first consider the case 

a k = b k (t) = V ^op (F,t) (386) 


Each b^(t) has a P -distribution of the form 





2 r 

*2 N 2 

P4 % * p k 

*2 * P k P k + 2 

X XX X 




with 


X i ([ b k (t| ' l ^ (t,T)e ^ )dT ]) s (x) 

o-^j = <^b£(t)- / h*(t, t) e*(T)d-rj j\> k (t) - / h k (t, t) e k (T) d-rj 


(387) 


(388) 


(389) 



(390) 


The total density operator is 

P (b,b t ) = n Pk = j/n P k (p k .Pk. t ). 

k k 


(391) 


This density operator describes one-time measurement of observables that are functions 
of t> k (t). Since (^(r) is defined in the spatial region from the transmitter to the receiver, 
it is clear that such h> k (t) would never be really measured, and we should find 
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density-operator representations for more realistic measurements. 

For this purpose, we assume that 

<+op< ?,t )-> = ° r ^ V^(t) (392) 

and expand 

«l» (r,t) = Z 6 k (?) a k (t) ? e V 2 (393) 

k 

= ^ p (?.t) F ^ V 2- (394) 

The spatial region V£(t) for which the mean output field is nonzero may be time- 
dependent. We also assume, for simplicity, that e k (F) are real orthonormal functions 
over V^(t) so that 0^( r- ) actually also carries a time dependence. We have 

a k (t) = h* V ?,t) 8 k (F) dF (395) 


so that 

<a k (t)> = / v , 0 k (F) dF f v dF' /q dt' G(Ft; F't') E(F', t') (396) 

< & k\\> = f V' 2 dFdF ' G k 1 (F) 9 k 2 (F,) \ (?) \ (r ' ,)<b k3 (t)b k 4 (t,) 

= (b'^(t)b'(t)) 2 / yi dFdF' e k (F) e k (F 1 ) 4>* (F) <t> k (F') 

k 2 12 3 3 

= 6 k k (b'W'ft)). (397) 

We have made the approximation 

< a k t(t)a k (t) > = 6 kk .< b ' t(t)b(t) >. (398) 

or equivalently 

h k (t, t) = h(t, t) independent of k. 

Similarly, it is easy to show that 


[a k i(t). a£(t)] = 6 kk , 


(399) 
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[a k ,(t),a k (t)j = 0 (400) 

( a k (t)a k’ (t) ) = 6 kk'^ b k (t)b k' (t) >- (401) 


It is clear that a k (t) is the photon operator t> k (t) restricted to the spatial region V^(t). 

A density operator p(a, a^) then results which describes measurement of b k (t) in the 

interval V^(t). We shall not write down the explicit form here, since it is a product of 

Gaussians. Note that since V' (t) can be of measurable size, we have a description 

^ t 

that is more realistic than the previous p(b, b'). 

It is worthwhile to observe that with the approximation (398) we can make a random 

variable transformation to express the product distribution II I\(Pk’ ^k’ *) of (391) in 

k 

terms of {g^, g^}, the associated classical amplitudes of {a k , a^}. The resulting dis- 


tribution differs from the one corresponding to {g^ a k } by a factor 


exp 


”1 /- vi dr 4^ (r, t) <|> 1 i|j C (r, t) 

2 r ^ V' -op Z -op 


(402) 


which corresponds to the field (r, t) outside V),(t). 

In these examples the total density operator is an infinite product of many compo- 
nent density operators and is therefore quite untractable. Assume that we have a situ- 
ation of digital communication with a total number M of messages {i}. For a specific 
set of input excitation E^(r, t) the mean output 

M 

<^ (r,t))=2 C n <a n (i)> e n (r,t) i = 1, . . . , M (403) 

p n=l 


can be expanded in terms of only M orthonormal functions {£ n (r,t)}. The signal infor- 
mation can then be obtained by observing functions of an infinite set 


= — / 
C 1 
n 


V r,t) 


£ (r, t) drdt 


n = 1, . . . , M. 


(404) 


By proper choice of the i -independent constants C , we can have 



n = 1, . . . ,M 


(40 5) 


so that a joint P -distribution for M amplitudes { c n l can be readily found. The resulting 
density operator should be much simpler than the operator like (391). Employing (if 
possible) Karhunen-Lofeve expansions of the type given in Section I-C, we can construct 
other simple receiver input representations. We shall not give the details of such a 
development here. 

We will now consider the following question: Which 'optimal' measurement is more 
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optimum, the one derived from (391) or the one based on (404)? 

5. 6 Relative Optimality of Different Receiver Configurations 

The optimality that we are talking about is that of communication system perfor- 
mance. We use criteria that are functions of the receiver input density operators and 
the message statistics. We shall first establish a means of judging whether a loss of 
optimality has occurred for a given receiver configuration. This criterion will be a 
quantum -mechanical version of the theorem of irrelevance^ for density operators. 

g v 

For this purpose, let be the total channel output density operator corresponding 
to the signals with subsystem density operators 


= tr z 4 (406) 

P^ = tr lP* (407) 

which describe measurements of observables X ^ and X 2 °f subsystems 1 and 2, respec- 
tively. As in the classical case, we want to find the condition under which measurements 
of any subsystem 2 observables furnish no further information about the signal S, 
given that any measurement of Xj has been made. For this to hold, we must demand 
that the conditional probability of measured x 2 given measured x^ for input signal S, 

p s (x 2 l X 1 (408) 


be independent of S for all subsystem observables Xj and X 2 - 
can be computed straightforwardly 

< x ll< x 2 l P^l x 2> K> 

< x ll P+ l x l> 

If we also define a conditional quasi-density P g IPfPi ) by 

P* = / p ,(p 2 P2>PiPl) IW <P 2 P. 1 I <jV 2 p, 
pJj = iPi> d2 pi 


P S (X 2 l X l } = 


The probability (408) 

(409) 

(410) 

(411) 


/ * * \ 




( 412 ) 


Eq. 409 can be written 
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( 413 ) 


1 p!(mO P s(^2I |3 1 I 3 D|< !I iI | 3 1>| 2 k x 2 IP 2 > I 2 

p s (x 2 l x i ) = ; — 1 

/ P?(PjPD l< x ilPi)| 2d2 Pi 

Unfortunately, we see that the stipulation that 

Ps^lPlO 


(414) 


be independent of S is not sufficient for (408) to be independent of S for all Xj and 
X 2> A sufficient condition is 

P S (V2^1^) = P ^D 


( 415 ) 


for which we can state the following theorem. 
Theorem 12 


If the subsystems 1 and 2 are normal-order -independent; that is, 

p s (wp i O = p ?( |, i p D p z(^^)' 


(416) 


then subsystem 2 can be ignored for signal processing without loss of optimality. 

We conjecture from (413) that (416) is also a necessary condition for subsystem 2 
to be neglected. This is in direct contrast to the classical case, and the difference is 
clear from (413) because we demand that any measurement of subsystem 1 make mea- 
surements of subsystem 2 fruitless, which is quite a strong condition. When Xj and X 2 
are specified, however, the problem becomes completely classical. Alternatively, we 

can also ask the question for fixed Xj but variable X 2 - There is no easy solution other 

s s 

than the condition (416). Note that (416) includes the particular case p, = p, <g> p , • 
Let us consider a given receiver configuration 


{W, (r, t)} 


{v r .v t } 

such that {w,(r, t)} is complete in the interval (363). We can express 

K 


V r ’ t) = 4 j op (r ’ t) + ,|j op (rit) ’ 


(417) 


(418) 


where 
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V r,t) = 4 'op (r - t) = 

V ?,t) = 4j op (?>t) 


2 a k W k (F,t) 
k 

fr ^ V 

It ^ V 


f rev 
It e v 


(419) 


(420) 


The signal dependence goes only to 4 j S (r,t). By Theorem 12, we have not lost optimality 

+ g 

by measuring functions of (a, a 1 ) if the random field ^ 0 p( r >t) is normal -order indepen- 
dent of the field ip^pfr, t). In our case i|j^p(r, t) is usually the additive noise so that for 

£ _ 

deterministic channels ip (r,t) can be ignored if the noise field in (V^, V, ) is normal- 

op jt l 

order independent of its other part outside. In particular, if the additive noise is white 
in both space and time, we need only look at the portion of the system containing 
the signal. In the presence of non-white additive noise or when ^ 0 p( r > t) and ip 0 p(r, t) 
are correlated in the normal order we shall be able to improve our performance, in 
principle, by observing ip 0 p( r >t). In this case ip D p(r,t) cannot be ignored by an optimum 
receiver. Note that optimality may also be degraded when the integral of ip (r, t) over 
W^(r, t) introduces additive noise. 

It is important to observe that because of preservation of commutators like (321) 
and (322) the additive noise component of ip Q p(r,t) cannot be completely white either spa- 
tially or temporally. Nevertheless, the additives can be normal-ordered white, in that 
the normal-order correlations are 6-correlated. In such a case there will be no loss 
of optimality by observing the field in a restricted space -time region. Intuitively this 
seems so, since the antinormal correlations, although non-white, contain no addi- 
tional information other than the commutator that we already know. 

Since we may have to be restricted to measurement intervals (363), for various rea- 
sons, it is more appropriate to ask whether in an expansion of the form 

M 

ip (r,t) = 2 C a | (r,t) + ip C (r, t), 

Y op j n n n T op 


rev 


t e v. 


(421) 


the optimal observation based on (a, af) suffers any possible performance degradation. 
When the additive noise is non-white it is clear that 4 / 0 p(r, t) of (421) cannot be ignored 
by the optimum receiver. In this situation the additive noise is frequently not "time- 
white. " 

In the example (386) measurement of simple time observables based on {b k (t)} 
entails no loss of optimality if we are constrained to make one-time measurements, 
although further observations would be desirable. No loss is attributed to space non- 
whiteness, since we are observing the total spatial volume under consideration. In 
the example following (393) the optimality is the same as in the previous case, even 
though we are observing a smaller spatial region. This is because under the approxi- 
mation (398) the additive noise is spatially white in normal order. 
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Although unknown loss of optimality occurs in situations like (421), the loss is likely 
to be small. The simplification in system analysis, design, and implementation 
resulting from (421) would probably favor its use rather than strictly optimal represen- 
tations. In conclusion, let us note that Theorem 12 cannot give the quantitative dif- 
ferences that may exist between two sub-optimal receiver configurations. It seems 
that there is no general way to obtain these differences except by individual optimal 
evaluation. 

5. 7 Complete Channel Representation 

We can represent our channel and the corresponding receiver input density operator 

as in Fig. 4. As we have shown, the system can be simplified for Gaussian quantum 

noise to the form shown in Fig. 5. We shall now summarize and discuss the general 

+ 

procedure for establishing p(a, a'). 

We first emphasize that in the absence of given classical information our linear 
quantum -channel characterization provides the general framework for the develop- 
ment of p(a, a^) from the transmitter channel receiver characteristics as outlined in 
Fig. 5. Parametrization of such characteristics has to be obtained from calculations 
or measurements for each individual problem, by using the approach that we have out- 
lined. The specific nature of a problem may be invoked to find the field commuta- 
tor (321) when fluctuation-dissipation theorems are not available, although this may not 
be easy. 

It is most desirable to ignore the specific nature of a problem and to obtain the 
quantum specification directly from a given classical specification with a prescribed 
procedure. Such a procedure can then be applied without detailed knowledge of quantum 
theory. We shall now show how such a procedure may result from our development 
when the commutator can be determined from the given classical information. 

1. The only essential difference between the quantum and classical cases lies in 
the commutator (321) which for Markov or stationary systems can be obtained from 
Eqs. 333, 334, or 340. 

2. Separating out this commutator, or the corresponding one for (r, t) in Fig. 5 
as in Eqs. 353-357, we have left a basically classical wave field. The normal -ordered 
averages of these fields may be identified with the given classical information. When 
appropriate, modifications like Eq. 313 may be introduced. 

3. Stochastic channels and signals can now be described in a classical manner. 

4. Form the observables (Eq. 379) for a given receiver configuration and find the 
P -representation by making sure that Eq. 380 holds. 

We summarize this procedure in the following formula. Let p(y, y ) be the distribu- 
tion describing the stochastic signals characterized by the random variables {y, y }. 

Assume that {a, a^} is given as in (379) so that (380) holds. Let the corresponding 
_ # 
classical variables be denoted (a, a ) so that 
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transmitter receiver 
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Fig. 5. Reduced representation of quantum communication systems. 








( 422 ) 


a k = f v V ^ r ’ ^ W k* r ’ ^ drdt + n k 
r t 

for the given classical field t) and noise n^ associated with W^(r, t). The noise n^ 
can be assumed to be Gaussian. The given classical information can then be used to 
calculate 


P(g, g ) = / p(a, a ; r, r ; g ) p(r, r ) p(g ) d rdg 
w — — . - — — & mn / K — - r \ 5 mn/ - Smn 


(423) 


/ * * k \ * 

where p(a, £ ; r , r ; g 1 is the Gaussian distribution for (g, g ) when the channel and 

signal are deterministic. We now have the following theorem. 


Theorem 13 

The density operator p(a, a^) can be represented in a P -distribution given by (422). 

The only important requirement for applying Theorem 13 is that the field commu- 
tator (321) is needed in general to find out the commutator for {a, a^}. The specific 
+ — * • - 

form of p(a, a 1 ) or ^P(g, a ) depends heavily on such commutators. On the other hand, 
we are not yet able to obtain (321) from the given classical information, except for 
the special cases of Markov and stationary systems. 

An even more important obstacle in applying Theorem 13 is that the usual classical 
specification is not given by a differential equation description. For an arbitrary spe- 
cified random filter we may not be able to interpret it as the Green's function of 
a differential equation. In general, there are consistency requirements that arise from 
the deterministic initial and boundary conditions. For our Markov and stationary cases 
the consistency requirement is even more severe. The difficulty is actually a classical 
one, of finding G-^(rt;f't') corresponding to a differential equation, which comes up 
unavoidably in our quantum situations. These points will be considered further. 

Note that if we are restricted to one-time measurements, we do not need two-time 
commutators. Since the one-time commutator is always known, the difficulty discussed 
above does not appear. Equation 321 is still needed, however, for complete specifica- 
tion of the quantum situation. The relation of our procedure to an ordinary description 
can be carried out as in section 2. 6 (Part I). In our correspondence we have to deal 
with the complete fields, however. 

5. 8 Conclusion 

We have discussed various points in connection with the development of quantum 
communication system models. They can be properly unified as in Fig. 5. When 
applicable, the procedure that we prescribe for the quantum classical transition is 
quite simple and can yield a variety of different representations. 

It is possible to put each individual quantum problem into our framework in an 
approximate fashion. The task is reduced to a classical development of a proper 
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wave equation and the determination of its corresponding G^(rt; r't'). While the field 
commutator Eq. 321 is crucial for a general quantum specification, and is not yet 
available in our general case, we feel that it should be possible to find it in general. 
In any case, a differential equation viewpoint is necessary for finding such commu- 
tators, and hence for our quantum classical correspondence. Alternatively, what 
we actually need is a procedure for canonical quantization of nonconservative linear 
stochastic systems. 
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F. APPLICATION TO OPTICAL CHANNELS 


We shall illustrate by some concrete examples the theory of quantum- channel repre- 
sentation that has just been described. We shall concentrate on the case in which a first- 
principle analysis is not available and only classical information is specified. Our 
purpose is to illustrate general procedures rather than to present detailed results. We 
shall treat one case in which a detailed analysis can be carried out from a basic physi- 
cal description of the situation. 

6. 1 Consistency Conditions for the Classical Quantum Transition 

We shall first elaborate upon the application of our correspondence procedure to a 
given classical field that has already been briefly described. It is clear that the develop- 
ment of a density-operator channel representation from our prescription is straight- 
forward in principle, although it may be difficult analytically. The important problem is 
the establishment of the' commutator Eq. 3 21 from the given classical information. Let 
us first observe more carefully the significance of this commutator. 

Aside from being a requirement for complete quantum specification of the field under 

consideration, the commutator Eq. 321 has to be explicitly invoked in determining the 

commutators Eqs. 3 80 and 381. The extent to which Eqs. 3 80-381 determine the form 

of the final density-operator representation can be seen as follows. When the resulting 

p"(a, a^) for the set {a, } of Eq. 379 is Gaussian the effect of Eq. 321 is simply a scaling 

— 1 * 

in the parameters of p(a, a ) or of Eq. 382. Without a precise knowledge of (321) the 
scaling effect cannot be determined. Although the operator form of p(a, a^) is the same 
regardless of the form of (321), we shall still not be able to determine the quantitative 
dependence of our results on the system parameters. Such a situation is clearly not 
acceptable. Furthermore, when p(a, a^) is not Gaussian its operator form, or its 
P- representation (Eq. 352), cannot be determined properly without the specific com- 
mutators Eqs. 380 and 381. The commutator (321) is therefore necessary for general 
{a^}. The only exception is when 

W k (F,t) cc 6(t-t Q ) 


in Eq. 


379. In this case only the one-time commutator 
^(r.t), ‘Hop^’ ’ = 


(424) 


or 

[«r op (?,t),<f op (? , ,t)] = in 6(?-P), (425) 

is needed. 

While we feel that it should be possible to find (321) in general from a classical 


95 



differential equation description of the channel, we have obtained results only in the 
Markov (or the vector Markov component) and stationary cases. We shall now investi- 
gate in these situations the applicability of our Eqs. 333, 334, and 340 to a classi- 
cally specified G^(rt;r't'). 

Let us assume that the given Gp(rt;r't') arises from a differential equation, and then 
determine what consistency conditions it has to satisfy. For Markov and stationary sys- 
tems the output fields have a particular structure that leads to consistency conditions 
for both the classical and the quantum fields. In the Markov case the conditions are 
the fluctuation-dissipation theorems Eqs. 33 5 and 3 70 averaged over channel statistics. 
The channel-averaged quantities in these relations are very difficult to compute and 
depend heavily on the individual random G^(rt;r't' ). It is not clear what they would 
imply about the structure of G^(rt;r't' ) in general. It is therefore appropriate to ignore 
these relations under the assumption that the processes are describable in a Markov or 
vector Markov way, at least as a first approximation. There still remain consistency 
conditions that arise from the deterministic initial and boundary conditions. 

Spatial boundary conditions of the problem are presumed to have been incorporated 
in the mode functions ^(r) in the expansion (Eq. 193) for + (?, t). Initial conditions 

give rise to the constraint (424) at equal time so that 

G^(rt;r't) = 6(r-r') (426) 

in the Markov case. From Eq. B. 11, the equal-time constraint on G (rt;r't) for the 

K 

vector Markov case is 
H n-1 

r G R (rt;r'T) = 6(r-r') (427) 

dt n_1 R t=x + 

t h 

when i involves derivatives up to the n order. It is clear that condition (426) pre- 
cludes interpreting a classical random filter with response function 

G d (rt;r't) = 0 (428) 

as the Green's function of a Markov differential system. 

Assuming G n (rt;r't') to be mean-square differentiable we can interchange differen- 
tiation and expectation operations so that (427) becomes 

,n- 1 _ _ 

— — [ G(rt;r'r) = 6(r-r'). (429) 

dt + 

This is inconsistent with (428). In any case, (426) or (429) becomes the necessary and 
sufficient condition for interpreting a given G^(rt;r't') as a random Green's function for 
Markov or vector Markov systems. In this case the field commutator (321) is 
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given by Eqs. 333, 334 or B. 10. 

In the stationary case the fluctuation-dissipation relation (Eq. 341) can be readily 
interpreted and puts a rather severe requirement on the correlations of the additive 
noise. Initial conditions, or (42 5), also require 


5t G R (rt;r'o) 


= 6(r-r.' ) 


t— T , 


(430) 


which has to hold in addition to Eq. 341. While we do not know the commutator (321) 
in the general case, we can see from Eq. 24 that we also have 


9t 


—J G R (rt;r't' ) 


t=t'. 


6(r-r' ) 


which is identical to (429). This condition arises from the constraint (42 5) at equal 
times. It is now clear that (429) is in general a necessary requirement for G„(rt;r't' ) 
to be the random Green's function of a differential equation corresponding to ^^(r, t) 

in Eq. 193. Depending on the nature of the fields, for example, 4* (r, t) or $ (r, t), 

^P 

the corresponding G^(rt;r't') would be different functions. Since t) and the elec- 

tromagnetic fields are related deterministically by linear operations, so are their cor- 
responding Green's functions. Explicit relations between such Green's functions can 
be obtained in general as in Appendix F. It suffices to note that the vanishing of one 
G R (rt;r't') implies the vanishing of all others from linearity. 

In summary, when (428) holds for a given random filter G^(rt;r't') we cannot strictly 
interpret it as the random Green's function of a differential equation. On the other hand, 
the commutator Eq. 189 can be immediately written when we are willing to accept the 
Markov assumption and, also, when (426) or (429) is satisfied. In the stationary case 
the additive noise correlations and G D (rt;r't') have further to obey Eq. 341 in addition 

I\ 

to the condition (43 0). 


6.2 Further Considerations 

To sharpen the discussion, let us consider a given random Green's function expanded 
in the form 


G f (rt;r't')= 2 g kn 4> k (r) \(r') y (t) y*(t'), 
kn 


(43 1) 


where ^(r) and y fi (t) are complete and orthonormal without weighting functions, and 
{gk n } a re random variables with given joint distributions. It is clear then that 


G f (rt;r't')= 2 g k n \ (r) ^k ^' ) y n (t) y n (t ' ) 
kn 


(432) 
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so that if Gj(rt;r't') is nonvanishing in some space-time intervals the averages of {g^} 
cannot all vanish. When t = t 1 it is reasonable to expect nonvanishing G^(rt;r't') for a 
differential system, since the channel disturbance has not yet begun to develop. While 
it is possible to expect G^rtjr't') = 0 for t » t', our expansion of the form (432) may not 
be consistent with such situations. 

It is possible to account for these situations in a phenomenological manner. Let 
G (rt;r't ') = A(r, t) G f (rt;r't') (433) 

be the new random Green's function. We assume that A(r, t) is a given classical random 
field when 

t » t' 

r » r' 

but is unity otherwise. The precise region of random A(r, t) can be specified depending 
on the individual problem. It is therefore possible, by considering (433) as the random 
Green's function, to satisfy the initial conditions and at the same time have the desired 
behavior at the channel output. The function 

G R (rt;r't') = A(r, t) G f (rt;r't') / (434) 

then enters into the commutator instead of G^(rt;r't'). The expression (434) gains further 
significance from the observation that for a multiplicative system 

^op (? ’ t) = A(? ’ t) ' ^ 

The transmitted field (r, t) is usually related to the source by a possibly random 
Green's function G^rtjr't'). A representation of the form (433) can therefore be regarded 
as quite satisfactory when the region of random A(r, t) is properly determined. 

Randomness in the output can also be attributed to the stochastic nature of the signals 
in the following way. Let a random Green's function of the form (431) be given. We 
write 



(r,t) 



G R (rt;r't') E(r',t') dr'dt' + n (r, t). 


(435) 


where we suppose that the commutator of n (r, t) is given by (426). Let us assume {g, } 

O p KTX 

to be independent so that we have a canonical diversity representation 


l *op (r,t)= 2 g kn'* > k (?) y n (t) ^ y n (t,) < * > k (r ' ) E(r '’ V) d? ' dt ' + n op 
kn 


(r, t). 


(436) 


Define 
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V ? ' t) = ^ V ?)y n (t) e kn g kn 


E(r, t) = 2 e,<j> (r) y (t) 
kn 




op 
We have 

a . 

kn 


n - (r ’t)= 2 g kn f k n V ?) y n (t) - 


kn 


r 1 - J 4>*(r) y*(t) * (r, t) drdt = ^ 

rf 1 rr 


’kn 


=kn 


‘W a k'n' 


6 kk' 6 nn' ' 


(437) 


(43 8) 


For independent Gaussian f^ we can immediately form 

P( £ , a*) = n P kn ( Qkn . a * n ) 


kn 


P km(“kn’ “kn) - exp 
nn kn 


“kn e kn g kn/ g kn' 


(439) 


(440) 


n. 


kn 


where £ = {° kn } is the associated classical amplitude of ( a kn }> and we have assumed 


^ f kn f kn^ = n kn 


<fj f, ) = 0. 

' kn kn' 


(441) 


(442) 


It can be seen from (440) that the randomness in g^ n may alternatively be introduced 
through e kn - Assume 


/ * \ 1 I 1 "kn 1 

p ( e kn' e kn/ = iT "P 1 T 


kn ^ kn 
The averaged P-function for a becomes 




(443) 


|2 


- / * \ 1 1 kn 1 

P kn\kn’ “kn) " Z” 6Xp 1 " - .=■ 

\n kn 


7r(n, + N ) 
kn kn 




(444) 


where we have assumed for simplicity that g is now chosen to be nonrandom. The 

Kn 
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form (444) is the familiar one of Gaussian signal in Gaussian noise. Note that we can- 
not assume {g^ n } to possess a distribution like (443), since in that case g kn = 0. 

Random-phase channels can be introduced through signals similar to those in the 
example above. While such a procedure is acceptable analytically, we feel nevertheless 
that it is more meaningful and correct to treat random channels by Green' s functions 
like (433), since the commutator (321) does reflect the channel mean response. This 
will be discussed further. 

6.3 Radiative Loss. and Dissipative Channels 

We wish to show the differences and similarities between quantum- channel repre- 
sentations of the radiative loss and dissipative channels. The radiative loss case is an 
example of a free- space optical channel. Classically, both channels may be regarded 
as additive noise that is free under appropriate conditions. It is interesting to find out 
whether they are also similar quantum- mechanically. 

6.3.1 Radiative Loss Channel 

Consider a field (r, t) at the channel output resulting from free-space transmis- 


sion. The expansion (Eq. 193) becomes 

-iw t 

+ op (r, t) = 2 cf> k (F) e b k (0), (445) 

k. 

where 

b k (0),bj,(0) = 6 kk , ‘ (446) 

and all other commutators are taken to be zero. The observables t> k (0) correspond to 
pure coherent states. 

< b k (0 ) >=P k (447) 

<b£(0)b k (0)> = p*p k (448) 

<b£(0)b£(0)> = 0 (449) 

<b k ( 0)b k (0) ) = °- (450) 

If {p } are the associated classical amplitudes of b (0), the P-distribution of b (t) is 
K K K. 

given by 

p k(v*»’ <«>) - 62 K - e 1 ” kt \) • < 451 > 
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The field commutator (321) is then 


V r ’ t), 4 i op {r '’ t ' 


* -i“Y. (t-t 1 ) 

= 2 V ?) v ?) e 

k * k • 


which is the free- space Green's function 


G F (Ft;r't') = G F (?-r';t-f). 


(452) 


(453) 


The observables (Eq. 379) can be taken to be, say. 


1 k = I dr I. e * ^~ (r ’ t) dt 

A 


op 


(454) 


b k (0) \ 


which are proportional to b^(0), with 


(455) 


A k = f A V ?) dr ' 

Each a^ therefore also corresponds to a pure coherent state so that Eq. 3 82 becomes 

p (v c k) = 62( °k/ A kA ) 

= i\i 2 62 (a kA\>- (456) 

We occasionally prefer to use a^, rather than = a^/A^, since it shows more clearly 
the distribution in a^. We also can derive from Eq. 379 


= f 4>*(r) dr J e ^ + op (r,t)dt, 


(457) 


and the resulting constant A^ would then reflect directly the relative energy intensities 
included in the observation volume or area A. 

In the presence of independent additive noise n(r, t) the field commutator (452) can 
be preserved by taking the noise to be classical. If the noise is stationary and Gaussian 
distributed, 


n(r, t) = 2 4> k (r) f R (t) 
k 


PkOk'C 4 ) =4expJ--^ 


rrn 


n 


(458) 


(459) 
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For independent the output distribution of (455) is modified to read 

. 2 ' 


= ^ exp | 


-iw, t 

P k" e ^k 


( 460 ) 


n 


Such a distribution would also arise when (3^ is a Gaussian random variable. The cor- 
responding distribution for {a, } is 


p k(v °k) = ^ e,£p 

TTn 


nA^ 

k 


(461) 


Again, of the type (454) or (457) can be formed, and the resulting p(a, a' ) worked out 
in a straightforward way. 

It is now clear that the radiative loss system in the absence of additive noise gives 
rise to pure quantum states — the coherent states of the electromagnetic field. This 
does not imply, however, that perfect performance can be readily achieved, because 
of the quantum nature of the received signal. 

6.3.2 Dissipative Channel 

We shall consider a simple stationary dissipative system with a Green's function 


G(rt;r't') = 2 <|> k (r) 4>*(r') lyt-t') 
k 


(462) 


for the field 


V ? ’ t)= l *k (?) b k (t) - 


(463) 


h k (t-t' ^ = exp j^ - T ^ t_t ' ^ ~ iw k* t-t ' ^ ' 


For concreteness, we may take, for example, 

(464) 

In this case the system is describable in a Markov manner, and, from Eq. 281, we have 
W~ r ’ th ^'’ V) t> f. (465) 


This commutator (465) can be immediately compared with (452). 

In this situation an additive quantum noise is needed to preserve (424). It is 
only necessary for that noise to have a commutator, so that (465) holds. The 
normal-ordered correlations of the additive noise can be taken to vanish, as, for 
example, when the system temperature goes to zero. Thus, if we write 
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fc> k (t) = exp(- yt-iu> k t) b k (0) + f k (t), (466) 

where b k ( 0) obeys (447)- (458), and 

<f£(t)f k (t)> = <f k (t)f k (t)> = 0, (467) 

we have 

<t> k (t)> = exp(- y t-iw k t) P k (468) 

<b£(t)b k (t)> = e' rt |p k | 2 . (469) 

The P-distribution of P k (t) is then 

P k(Pk (t >> P'I>>) = fi2 [Pk - exp (" T^k 1 ) P k ]‘ (470) 

For 


/ A dr / exp^iaj k t+ P t) + op (r, t) dt 


A k b k (0) ’ 


(471) 


the resulting distribution P(° k ' a k ) is identical to (456) if the integral in (471) can be 
physically implemented without additive noise. Such an integral implies amplification, 
and so will probably introduce an additive noise. 

When the system is at a finite temperature there will be additive noise associated 
with the dissipation. When according to our Markov assumption 


<f£(t)f k (t')> = r n6(t-t’) 


(472) 


<f k (t)f k (P)> = 0, 


(473) 


we have 


P k( P k’C t ) = ^ eXp j 


P k ~ exp (~ iM k t ~ T*) P h 


t 


(474) 


The resulting distribution for a k of the form (471) is the same as (461) even in the pres- 
ence of amplification noise, since further Gaussian additive noise can be accounted 
for by adding the corresponding noise power to n. Further discussion will be 
found in Appendix C. 
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6.3.3 Comparison 

Let us observe that the first difference between the radiative and dissipative loss 
channels lies in the corresponding field commutator (452) vs (465). In spite of this, the 
one-time distribution (470), while different from (455), is essentially the same as (456). 
In both cases the observables suffer losses, radiative for and dissipative for P^ft)- 
Moreover; note that we have pure coherent states in all of these situations. Similarly, 
the distributions (460) and (461) can be compared with (474). In the absence of ampli- 
fication noise the a^of (471) is identical to the a^ of (457). 

. ‘ In general, the dissipative case is more complicated. In the first place, amplifi- 
cation noise may be introduced. In the second place, observables like (457) are coupled 
for the dissipative channel. It should be clear, however, that when noiseless amplifi- 
cation like (471) is possible, the two situations can always be made identical with 
a different receiver. 

In summary, the two situations are basically identical for one-time measurements. 

In general they will be identical insofar as an integral of the (471) type can be imple- 
mented without additive noise. This is possible in principle classically, but we indi- 
cate in Appendix C that this is not possible quantum- mechanically if the integral is 
implemented by a linear filter. Except for this point, the quantum situation is com- 
pletely analogous to the classical one, with similar quantum- channel representations. 

6 . 4. Atmospheric Channel 

We now apply our previous consideration in section 6. 1 to the turbulent atmospheric 
50 78 

optical channel. ’ ■ Our first task is to establish the field commutator Eq. 321 from 

given classical specifications. While there are basic differential equation descriptions 

132 13 4 5 ^ 

for electromagnetic transmission through the atmosphere, ’ we shall not pursue 

such a detailed quantum development from first principles. Instead, we shall try to 
find the quantum model directly from our procedures for classical quantum correspon- 
dence. This would demonstrate the generality and convenience of our treatment. 
Warning should be given at the outset that our result is approximate, although it may 
be adequate for communication analysis. 

: In the usual model of a turbulent atmosphere, dissipative losses are neglected. When 
the turbulence is turned off, the field commutator is then clearly given by the free- space 
Green's function (453), where the set ^(r) is chosen depending on system geometry. 
When turbulence is introduced, one. may expect the mean Green's function to remain 
basically the free- space function. While this is not true in general, we shall show that 
'our. correspondence procedure supports this proposition as applied to many potential 
receiver configurations. . : 

We - first argue that the output field (442) is strictly Markov, i This follows in general 
from the time harmonic field that is usually assumed for the turbulent atmosphere, and 
also follows from the approximation that we shall make. The field commutator Eq. 321 
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is now given by the averaged Green's functions. 

We assume, as before, that the transmitted electric field is related to the output 
field by a log-normal multiplicative process 




O r ’ tK 


(475) 


where -y(r, t) is a complex Gaussian process and <^ 0 p(^>t) is the complex envelope of 

the electric field. The transmitted field (r, t) is then related to the source through 

the free-space propagation. The field i|j 0 p{r, t) is related to <o (r, t) through a linear 

deterministic filter in general. We further assume that- the. process y(r, t) is stationary 

in both space and time arguments.^ 0 The average Green' s function for 4 1 (r, t) is then 

op 

of the form 


G R (r, t;r't' ) = e v(r,t) G F (rt;r't') 


= C y X G F (?t;r't') 


(476) 


with the free-space Green's fimction G F (rt;r't') and a multiplicative constant C . 

As we have explained in section 6. 1, the G D (rt;r't') of (476) cannot be interpreted 

as the average random Green's fimction of a differential equation for all {r.-r'rt, t'} when 

C # 1. It is possible to have such an interpretation when the constant C is turned on 
^ _ 7 * 

only for t » t' and r » r' . We can therefore consider (476) as the G R (rt;r't') of (433). We 

still have to determine the region where C begins to be important. 

The commutator Eq. 321, which is now given by (476),- is used only in constructing. 

density-operator models. A^^l construction in this case involves integrating t and 

t' within the same coherent time interval and r and ?' within a diversity qf coherence 

areas. If the turbulence effect has not modified the field propagation significantly in.. 

a time interval t — t' and a space interval r - r' that are small compared -with the 

coherent time and the distance traveled in that time, respectively, it is reasonable • 

to approximate by unity in (476). We can then take Eq. 321 to be the free-space ' 

Green's function G Tr (rt;r't') for applications to density-operator calculations of many 

* A/ 

receiver configurations. For large t - t', the G R (rt;r't') will be given in our approxi- 
mation by (477). The precise behavior of G R (rt;r't') for all time has to be obtained by 
a more detailed classical analysis. • * 

The assumption that is actually -required for using G F (rt;r't') as G R (rt;r't') in our 
field commutator applications is that.the signal- processing time should be short enough 
so that in the scale of field propagation the turbulence effect is still not important in 
determining G R (rt;r't'). With the high velocity of light the corresponding space interval 
would certainly be large enough to include the signal-processing areas. This character- 
istic time at which turbulence starts to turn on has again to be obtained classically. We 
may nevertheless always use as a first approximation the original Green' s function of * 
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the transmission medium without perturbing effect as the G^(rt;r't') in the field com- 
mutator Eq. 321. 

It may frequently be convenient to employ the commutator between the electric or 
magnetic fields for receiver input modeling. This has the further advantage that the 
electromagnetic fields are more readily observable dynamical quantities. It is shown 
in Appendix F that in general we have 

[«?(?, t),<f(r,t)] = ifi Re G R (rt;r't'), (477) 

where G R (rt;r't') is the Green's function for 4j Q p(r, t) of the form (463). 

Let us now give the density-operator representation of the atmospheric channel for 
the following kind of receiver configuration. We assume that the free- space Green's 
function is expanded in the form 

G F (rt;r't') = Z j- <M?) <£(?') y fc (t) y*(f). (478) 

k 

Over a time interval T we assume 


J T y k (t)y*(t)dt= 5 kkI . 


(479) 


For convenience, let us employ a cylindrical coordinate system r = (z, p ) with the fol- 
lowing property for <fv. (z, p ). At certain points z we assume 

K. O 



Vv p) *k' ( v ) dp = 6 kk' v 


(480) 


over the coherence area A in the received plane at z . Here y is a constant smaller 

c ^ o 

than 1 when 4> 1 ,(i') is orthonormal over the spatial volume of interest. We now define 


\ £ k Ar y k (t) dt ^A 


6(z-z q ) 4> k (z Q , P ) ^ (r, t) dzdp, 


(481) 


so that by using (478) as our field commutator, 



(482) 


The variables 



(483) 


are properly normalized photon operators. 

Suppose that y(r, t) of (475) is completely correlated over the time interval and 
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spatial area where it is correlated at all, and is completely uncorrelated from one such 

7 8 

interval and area to another. Moreover, we assume that T is smaller than the coher- 
ent time interval. We can then write 


a k =za k +n k’ (484) 

where a^ is a parameter depending on the signal. The random variable z is 

z = ue^ (485) 

with independent distributions for u and 4>: 

(486) 

(487) 


P(u) = 


2 2 

(In u + tr ) 




exp < - 


u > 0 


<ru 


2 (T 


P(^) = 27T ’ 


2n > 4> > o. 


We assume that n^ is a Gaussian quantum noise that is uncorrelated to any order with z. 
Strictly speaking, is a Gaussian noise also multiplied by z so that, while still 
Gaussian; it may have some higher order correlations with z. These correlations we 
neglect here for simplicity. 

The variables a^ possess a joint P -distribution 


p<i) = n P k (\). 


(488) 


where for fixed z 


W - 


exp < - 


7rn, 


V 



and the {a^} are the associated classical amplitudes of {a^.}- We have assumed 


< n K> = \ 

< n k \>= o. 


(489) 


Averaging (489) over the distribution for z, we finally obtain 
2 


P k<V‘k ) = 


JTn, 


exp < - 





du I 




„ 2,2 .2 2 

(In u + v ) a, u 


V 2fi cr 


exp <- 


2cr 


n,. 


(490) 
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where I (x) is the zero-order modified Bessel function of the first kind. The 

..... t~ 

P -distribution is diagonal in the photon number representation of a^a^. If we are care- 
ful about the corresponding density -operator representation we can change variables 
back to {a^} for {a^,} by direct substitution of in (490). 

We have considered only one coherence area. The extension to a diversity of many 
coherence areas will be straightforward. Furthermore, we may see that many other 
density -operator representations can be formed, even with the many approximations 
that we have made. One most susceptible assumption is that within the signal -processing 
time T the turbulence effect represented by is not yet significant in G^(rt;r't'). 

It appears that a more detailed consideration of the problem at the classical level is 
required for a better quantum treatment. 

6. 5 Multiple Scattering Channel 

50 82 

We shall give only a brief consideration of scattering channels ’ that describe 
optical communication through clouds, fog, and haze. The first question for a quantum 
formulation is again the development of an appropriate field commutator at the receiver. 

Scattering channels are usually characterized classically by randomly varying space - 
time linear filters which are sample functions of Gaussian processes. They are 
analogous to ordinary fading dispersive channels with the added complication of spatial 
fading. The mean output field of the mean impulse response is again taken to vanish. 
Thus this problem falls into the general case that we treated in sections 6. 1 and 6. 2. 

The filter cannot therefore be interpreted as a random Green's function without mod- 
ification. 

Our argument in sections 6. 2 and 6.4 suggests that for t close to t' and r close 
to r' the average filter response G^(rt;r't') should not vanish, and can be taken to be 
the free -space Green's function for receiver input calculation. In the present form this 
is not a very good application, particularly for an earth -to -space optical link. In the 
absence of a detailed consideration one may take the free-space Green's function for 
the field commutator as a first approximation. 

Once the field commutator is known, it is straightforward to obtain density -operator 

representation for different receiver configurations. Since our received field is 

Gaussian, the calculation is further simplified because the signal -carrying processes 

and the pure-noise processes are independent. With a Karhunen-Lofeve expansion of 

G-rJrt; r '(t ' ) , or an expansion of the type of Eq. 84, the problem is reduced to a 
ri 

canonical diversity representation where each diversity path is a Gaussian multi- 
plicative or Rayleigh fading channel. For brevity, we are omitting the obvious pro- 
cedures for carrying out such an analysis. 

If the phase information has been already completely destroyed at the receiver, pre- 
sumably a direct-energy measurement would be made. In such a case it might appear 
that the field -amplitude commutator is not needed. That this is not so is clear if we 
note that we require the field commutator for calculation of the receiver input . 
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density operators, which in turn are needed for calculations of photon measure- 
ment probabilities. 

6. 6 Guided Optical Transmission 

We shall give a first-principle quantum description of an optical transmission line 
considered as a communication channel. The purpose of such an analysis is to show 
the relation of our general theory to concrete situations by an example in which things 
can be worked out in detail, and to gain further confidence in our general results. 
Our pace here will be rather rapid, omitting many detailed derivations. 

Let the voltage and current along a one -dimensional TEM wave transmission line 
be V(z, t) and I(z, t) which obey the dissipative equations 


9V _ _ T 31 

9z L at 


li = _c lr ■ GV + I 0 (t) 6(z) + F(z> t} 


(491) 


for a source current I Q (t) and noise F(z,t). Introduce the potential A(z,t) so that 


t , - _ 3A 

K z ,t) - ~ gj 


V(z, t) = L 


9A 

at 


(492) 


We have from (491) 


9 2 A 1 a 2 A 9A 

8z 2 " s 2 at 2 ‘ 9t = 


-I Q (t) 6(z) - F(z,t) 


(493) 


2 _ 1 
S “ LC 


(494) 


The conjugate field for A(z, t) is 


. _ L 3 A 

7 r(z, t) - 2 3t ’ 
s 


(495) 


and from canonical quantization 

[A(z, t), 7T(z ', t)] = iK 6(z-z '). 

Introduce the initial and boundary conditions 
ai(z, t)| 


l(z, o ) = 


at 


= 0 


t=0 


(496) 


(497) 
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I(o, t) = 1(1, t) = 0, 


(498) 


where the length H of the line can become infinite. In general we shall only look at the 
signal field before it reaches the end of the line so that a finite 1 is just a mathe- 
matical convenience. 

We can now expand A(z, t) in the standing-wave modes 


A(z, t) 



£ cos k zq (t), 
n M n 
n 


with 


(499) 


= k n s ' k n = T : integer n 


and [q n (t), q n ( t)] = ifi. If we expand 


1/2 


P(z.t) = (^p) 2 co n cos k n z{f n (t)+fj(t)} 


(500) 


72 73 

and adopt a Markov -rotating -wave approximation ’ we have, corresponding to 
Eq. 216, 

db 


"df = -iu) n b n " 7 * b n + f n (t) + ( IcE^ 


n 




(501) 


where 




[»„<*>• b l {t \ 


= 1 


G 

v = C 




(t) 


= v6(t-T) 6 . 

1 ' ' nn 1 


< f n (t)f n' (T) > = Vn6(t-T) 6^, 


< f n ( t > f n -< T » =0 


(502) 

(503) 

(504) 

(505) 

(506) 

(507) 


and n is a Bose distribution as before. When f n (t) is taken to be Gaussian, specification 
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of the relevant fields is complete. The Markov -rotating -wave approximation is valid 
when y/^ n is negligibly small for the frequency modes of interest. A fundamental deri- 
vation of (501) can be given by considering a coupled system and reservoir. 
Introducing the field (Eq. 193) 

ijj (z,t) = Z cos k zb (t), (508) 

T op n n ' ' 


we have (Eq. 321) 

4j (z, t), k)j^ (z t') 
mp op 

which can be compared with (46 2) -(46 5). It is clear that a P -distribution for each b n (t) 
can be written with the same form as (474). 

The Green's function of our differential equation (493) under a source -6(z) 6(t) and 
our boundary condition in general is 

V 

(t— t ') , 

G(zt;z’t') = e 2 u _i (t -t' - ^~) I Q 

which is of course space -time invariant. The function U_ 1 (x) denotes the unit step func- 
tion, and I q is the modified Bessel function of zero order. In our approximation we take 



Z cos k z cos 
n 


k n z' exp j - - (t-f) -i« n (t-t')| (509) 


G(zt; z 't ' ) = 


e 


V 

2 " ( t-t ') 


u _l(t-t'- 


*?) 


(511) 


so that under a source — (t ) 6(z), the output current is 


Kz-t) = exp{~ }l g (t-f). 


(512) 


Other output fields including the noise can be readily obtained from (511). Such explicit 
construction of the detailed physical space -time behavior for the fields is clearly useful. 

To demonstrate the usefulness of such an explicit representation, let us suppose that 
the signal I g (t) is turned on for a duration T only. The mean current (I(z,t)) is then 
nonvanishing only in the space interval st to s(t-T) at any moment t. The mean voltage 
is similarly nonvanishing only in such an interval. We can construct a density -operator 
representation for linear functionals of 

4<(z,t) = CI(z,t) + iV(z,t) (513) 


or other similar fields like A(z,t). Here C is a real constant. For a choice 
of Eq. 379 
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(514) 


a = ^s(t-T) ^ z - t) yn( t -f) ex p(^) dz 

the commutator 
[a, a^] 

can be determined from 

[V(z',t),I(z,t)] = ifisV(z-z'). (515) 

When 

V‘>“yn(‘-fK (516 > 

/ yj(t) dt = 1 (517) 

the operator a as obtained from (514) contains I directly as its mean. A single den- 

‘ *1 ® 

sity operator can then be developed for (a, a ' ) without loss of optimality when the addi- 
tive noise is spatially white, which is the case for our frequency band of interest. This 
density operator can be compared with 

p = -^n p k (p k .pi.t) 

constructed from each of the mode amplitudes b k (t). The resulting simplification is 
indeed enormous, especially for a parameter estimation problem when we want to 
estimate I g . 

It is clear that the statistical dynamical problem here is completely solved. We have 
both the detailed Heisenberg operator solutions and the relevant statistics derived from 
what we may call a first -principle calculation. Again, many density operators can be 
formed, and since this is a straightforward exercise we shall not dwell on the pro- 
cedure. 

6. 7 Conclusion 

We have given several explicit examples, together with a general consideration, in 
our procedure for obtaining quantum -channel representation from given classical spe- 
cification. Our purpose has been only to indicate the convenience and generality of our 
method, rather than to present an exhaustive treatment of the individual optical channels. 
From the description that we have given, we can construct the density-operator repre- 
sentation for any convenient receiver configuration. 

An important point in our discussion is that the unperturbed Green's function may 
be used quite generally as a first approximation in the field commutator employed for 
receiver input calculations. When this commutator is known our classical quantum cor- 
respondence is completed. The extent to which this use of unperturbed Green's function 
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will be a good approximation is still unknown. It is clear that for both the atmospheric 
and scattering channels it cannot be held unconditionally. It appears that further detailed 
classical analyses, particularly those from a differential equation viewpoint, will be 
required to give more accurate quantitative determination of the field commutators or 
the system Green's functions. 
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G. CONCLUSION TO PART I 


We shall give a final synopsis of the major points in our quantum communication 
system modeling and some discussion of the nature of our approach. Suggestions for 
more useful work in this area will also be indicated. 

7. 1 Summary of Results 

In Part I we have developed the general quantum representation of channels that are 
describable by linear equations. For this purpose our theory provides a general frame- 
work for obtaining the specifying parameters in the quantum description by various 
means. A most convenient way of achieving the quantum specification is through the given 
classical specification. In such a situation we have to obtain the channel output field 
commutator from the classical information. Receiver input density operators can then 
be calculated directly from a classical statistical specification of the output field. Our 
field commutator specification is limited, however, to given classical Markov or sta- 
tionary systems, so that we assume that the corresponding quantum system is also 
Markov or stationary. 

Our final construction of the P- representation given by Theorem 13 is quite simple. 
In particular, if, in the absence of noise, our transmitter generates a coherent state 
at the channel output, then, in the presence of noise and other channel- signal statistics, 
the channel output is a classical superposition of coherent states. 

It may be argued that this construction and interpretation are obvious without our 
analysis. Our theory, however, illuminates the assumptions that are inherent in such 
a procedure, including the special form of quantum statistics that we take for the fields. 
More important is the essential point that in this procedure the field commutator has 
to be known for a proper construction of density operators. Thus, an application to an 
arbitrary set of variables a in the procedure will lead to incorrect results. It should 
be clear that such a procedure will usually have no meaning unless the field commutator 
is derived from the classical information. 

We have given some examples that are pertinent to optical channels to illustrate 
applications of our procedure. The important lesson to learn from these applications 
is that the classical information is not always directly given in a suitable form for 
transition to the quantum region. Various classical analyses may be needed to put the 
classical information in a correct form. In this connection it has been noted that 
descriptions of classical communication systems from a physical differential equation 
viewpoint will be more convenient for quantization. 

We have presented primarily a framework in which linear quantum channel repre- 
sentation can be developed, particularly from given classical specification. We have 
also considered some matters of independent interest, for example, a theory of quan- 
tum random processes and fluctuation- amplification theorems. Also, our theory of 
quantum field propagation can be immediately adopted as a theory of quantum noise in 
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traveling-wave amplifiers. There are several problems, however, that need further 
study for a complete theory of communication system modeling. 

7.2 Suggestions for Further Research 

The most important unsolved problem is the proper development, from classical 
information only, of the field commutator that is applicable in a general situation. This 
can be viewed as a problem of canonically quantizing a nonconservative stochastic sys- 
tem. A brief discussion of possible approaches to this problem has been given before 
and also in Appendix C. 

Another outstanding problem is the development of convenient density operators or 
measurement probabilities for any receiver configuration and observable. Whether and, 
if so, how this can be done is uncertain. 

The transmitter that we have assumed generates only coherent states or their super- 
positions. This can be shown to be necessary if the channel output field is going to relate 
linearly to the input excitations. Moreover, one may want to generate other states at 
the expense of allowing nonlinearity in the system. Given a channel structure, it may 
be possible to formulate this problem in a manner analogous to our development. In 
general, analysis will be complicated by nonlinearity. This problem is interesting 
enough to deserve much attention. 

We have not considered the problem of developing a physical implementation of a 
given quantum measurement, except for a brief theoretical discussion in Appendix E. 

This problem is somewhat remote from channel modeling; nevertheless, it is an impor- 
tant matter that is closely connected with a more physical description of communication 
systems. 

Some other generalizations of our theory are discussed in Appendix D. Despite its 
apparent generality, we conclude that our theory leaves many fruitful areas that are 
open for further investigation. 
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Part II. Optimization of Communication System Performance 

A. INTRODUCTION AND DETECTION THEORY FORMULATION 

In Part II we shall take up the problem of optimizing quantum communication system 
performance under various performance critieria. We shall concentrate on M-ary digi- 
tal signal detection and only briefly consider other areas. Our main results are some 
necessary and sufficient conditions on general optimal receiver specification in quan- 
tum detection theory. We have not yet seriously exploited the applications of these con- 
ditions. 

1. 1 Relation to Previous Work 

In classical communication theory the general mathematical specification of 

receivers is an important conceptual problem whose solution is well-known. The 

conceptual problem also arises in quantum communication theory, but the general 

solution is still to be found. For digital quantum detection the optimal receiver 

41 43 44 

specification is known only in very special cases, ’ ’ and no general conditions that 

the optimal detector must satisfy have been given. The minimum mean-square-error 

46 

quantum estimate (MMSEQ) of a single random parameter has also been worked out 

47 42 

and bounds of the Cram6r-Rao have been given for both random and nonrandom 
parameter estimations. The measurement observables in these works are restricted, 
however, to self-adjoint operators. The general MMSEQ in the multiple -parameter case 
is still unknown, as is the maximum-likelihood quantum (MLQ) estimate. The Wiener- 
Kalman type of continuous filtering also has no quantum analogy at present. 

We shall examine these general specification problems, and give some general con- 
ditions on the optimal digital detector, together with some examples illustrating our 
results. We shall extend some of the previous work on estimation and analog com- 
munication. A final summary for Part II will be included with suggestions for treat- 
ment of other optimization problems. First, we shall give several careful formulations 
of the detection problem. 

1. 2 Background 

In establishing the results of Part II we have to pursue mathematical rigor, in con- 
trast to Part I where the precise conditions of validity are relatively unimportant. We 
shall employ some general optimization theories that are applicable to abstract normed 
linear spaces. We shall also need certain properties of various spaces of operators 
corresponding to our quantities of interest. The mathematical theories of these sub- 
jects are of relatively recent origin and their detailed exposition may be found in 
the references; some will be briefly discussed in Appendix H and Appendix I. 
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1 . 3 Original Formulation of the Detection Problem 

A detailed treatment of quantum channels and the corresponding receiver input den- 
sity operator representations has been given in Part I. These receiver input density 
operators are the basic given quantities in the formulation of our detection problems. 

For present purposes, we model our communication system as in Fig. 6. The S^(t) which 



Fig. 6. Simplified representation of quantum communication systems. 

represents the message information on the signal is an ordinary time function, and the 
dependence of on j is hidden in some parameters in the expressions of the p^ Our 
p's are the analogs of conditional probabilities in the classical case. If the message 
ensemble is continuous, we still have the same kind of Pj representation, the only dif- 
ference being that now j runs through a continuous set. With this description we can 
begin to formulate the detection problems. 

Let 3C be a separable Hilbert space over the complex field $ whose elements are 
the quantum states | ) on which our p's are defined. Suppose that we have an M-ary 
equiprobable message alphabet {j= 1,. . . ,M}with the corresponding channel output for 
message j described by the density operator p^.. Each p^ is therefore a self-adjoint 
positive semidefinite operator of unit trace on 5C. [See Appendix H for a brief resume 
of some basic mathematical definitions and facts that we shall use.] 

Suppose that we make a quantum measurement of the observable X on the receiver 
input. We take the class of measurable operators to be those whose eigenvectors form 
a complete orthonormal or overcomplete set in 3C. Thus the measurable operators are 
the observables that are defined in Appendix A. As discussed in Appendix G, not all 
such observables have been explicitly shown to be measurable, in the sense that the 
eigenvectors of the operators are used to compute measurement probabilities and the 
eigenvalues are the measured parameters. We can therefore make the qualification that 
X should indeed be measurable in the following formulation. This qualification is not 
relevant to our work since we feel that such observables can ultimately be shown to be 
measurable, and in any case we shall not deal with this formulation in its original form. 
The difficulty arises only if the conjugate Hermitian components of X possess a q-number 
commutator. 

th 

The probability that an eigenvalue x of X is measured, given that the j 
message is sent, is then „ . 


p(x |m.) = (n ( Pj |x), 
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where lx) is the eigenvector of X. Let us adopt a possible random strategy with 7r(x) 

th 

being the probability that we decide that the j message was sent, given the measured 
value x. The total probability of correct decision is then 


P f C l = M ^ ^ < x l p j ! x > dx ' 


(518) 


We use the integral here merely as a symbol, in that it represents a sum in the discrete 

case and an integral over the relevant variables in the continuous case. 

In our detection problem we wish to maximize (518) for given p., subject to the fol- 

, J 

lowing constraints. First, the eigenvectors x), which we use to imply either com- 
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plete orthonormal or overcomplete must, be complete. Thus 


/ lx) <x| dx = I, 


(519) 


where I is the identity operator on 3C. The decision function n:(x) obeys 
7L(x)5=0 


(520) 


Z 7T (x) = 1. 

j J 


(521) 


Then the problem is to maximize (518), subject to (519)-(521). To be precise we have 
also to add the constraint that the resulting X be measurable. 

This formulation of the detection problem, which we call 0, is the most accurate 
and general one. It is very difficult to handle, however, because of the dependence of 

tj-.(x) on the parameter x which is still unknown. Therefore it is necessary to trans- 

J 41 

form it to a more convenient form. Helstrom first gave an operator formulation of 

the problem in which he considered only orthonormal sets {|x)}. Particular caution 

should be used when including overcomplete sets. We shall develop several formulations 

of the detection problem for orthonormal sets {|x)} and for general complete. sets. 

1. 4 Operator Formulation of the Detection Problem 

4 1 

Let us introduce the detection operators 


7T = / 7T (x) | x ) <x | dx, 

J J 


(522) 


where we have let all 7T be simultaneously diagonal in a complete set {|x)} so that 
the decision can be made by measuring X. Then condition (521) is equivalent through 
(519) to the operator constraint 


Z 7T . = I , 

J J 

and (518) can be written in operator form 


( 523 ) 
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(524) 


P[C] = m = tr. » jPj . 

The condition (520) has to be left in its original form in general, since { | x )} may be 
overcomplete. The constraints on the 7T restrict them to be self-adjoint positive semi- 
definite bounded operators. If (|x)} happens to be an orthonormal set, then (520) follows 
from positive semidefiniteness. In principle, this formulation also includes overcom- 
plete sets. 

We then have to maximize (524) by choosing {n^}, subject to (520), (522), and (523). 
This problem, which we call I, appears to be little more than a rewriting of our orig- 
inal formulation 0. There is significant difference, however, in the quantities chosen 
for optimization. With this formulation we have transformed Problem 0 to an operator 
optimization Problem I. This problem is untractable because the constraint (522) that 
the 7T j be simultaneously expressible in diagonal form in terms of the same complete 
set is hard to handle. It makes the domain of optimization nonconvex and it cannot be 
expressed as an explicit equality constraint. We must therefore consider some variants 
of the problem. 


1. 5 Broader Operator Formulation of the Detection Problem 


A more general problem, which we call II, can be set up by dropping^the difficult 
constraint (522). Thus Problem II is: Given {p^}, maximize (524) by choosing positive 

semidefinite self-adjoint bounded operators {ir}, subject to (523). 

The solution set of ir j of Problem II is not guaranteed to be simultaneously express- 
ible in diagonal form in the same set of vectors. Even if they are simultaneously diag- 
onal in an overcomplete representation, the fl\(x) are not necessarily positive for all x. 
Furthermore, their simultaneously diagonal representation, if it exists, may not be 
measurable. Nevertheless, this formulation is useful because it permits exact solu- 
tions and may yield a usable set for the original problem. Its solution will yield at least 
an upper bound on the probability of correct decision given by (524). 


1. 6 Operator Formulation Allowing Only Self-Adjoint Observables 

When we are restricted to measurements of self-adjoint operators, the usual observ- 
ables that are referred to in quantum theory, the original formulation can be greatly 
simplified. We first observe that the use of a nonrandom strategy for any complete set 
{|x)}is generally optimum. In fact, once the set { | x )} is given, we can only do worse 
by using a random strategy, just as in the classical case. Thus without loss of optimal- 
ity we can take 

rr i (x)n.(x)= 0, i * j. (525) 

For Problems I and II the application of (525) does not lead to simplifcation of 
the constraints, because of the possibility of overcomplete {|x}}. When { | x )} is 
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orthonormal, from (522) and (525), we have 


7T.7T . = 0 ¥ i * j. 

i J 


( 526 ) 


Lemma 1 


The constraints (526) are equivalent through (523) to the smaller set of conditions 


7T. = 7T.. 
l l 


(527) 


Proof: Multiplying both sides of (523) by jt., we have (527) immediately. Given (523) 

* 13 5 

and (527), conditions (526) follow from the theorem, which states that a finite sum 
of projection operators is a projection operator if and only if the operators are pairwise 
orthogonal. 

Note that (526) implies, in particular, that 


[tt.tt] = 0, ¥ (i, j). 


(528) 


While an arbitrary set of commuting self-adjoint operators may not possess a complete 
orthonormal set of simultaneous eigenvectors, our {7r} do have such a simultaneous set. 
Since this point is of some importance we state the following lemma. 

Lemma 2 

Our detection operators obeying (523) and (527) possess many complete orthonormal 
sets of simultaneous eigenvectors. 


Proof : Such operators {n^} are orthogonal projection operators so that their ranges 

are orthogonal subspaces of 3C. Within each of these subspaces any complete ortho- 
normal set can be formed which automatically has eigenvectors of all of the {7r}. 
Adjoining all such sets, we have a complete orthonormal set of simultaneous eigenvec- 
tors for the {?r}. Different choices of eigenvector subsets in each subspace give rise 
to different sets of simultaneous eigenvectors. 

By restricting ourselves to self-adjoint operators, our Problems 0 or I are trans- 
formed to the problem, which we call III, of maximizing (524), subject to (523) and 
(526) or (527). Note that the {tt} are automatically positive semidefinite under these 
constraints. Furthermore, the solution set of this problem is guaranteed by Lemma 2 
to be simultaneously diagonal in many complete orthonormal sets. The general quan- 
tum measurement of a self-adjoint observable possessing a complete set of eigenvectors 
is explicitly shown in Appendix G to be possible in principle. The many complete ortho- 
normal simultaneous eigenvector sets are all equivalent in detection error per- 
formance. 
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1 . 7 Conclusion 


We have given three different operator formulations of the detection problem. These 
formulations of digital error minimization with equiprobable messages are actually as 
general as an arbitrary quantum M-ary decision problem. For the Bayes or the Neyman- 
Pearson criteria we are led to minimize the average cost 

MM 

C= Z z P-C., / <n| p | x ) TT (x) dx 
i=l j=l J 1J J 1 , 


= Z / <x| p! |n) tt.(x) dx 


(529) 


with 


p ! = Z p.C. p .. 
i j iJ J 


Thus our previous formulations remain the same except for substitution of the p? for 
pj. The pj are also positive semidefinite self-adjoint operators of the trace class. 


B. OPTIMAL DETECTOR SPECIFICATION AND EXAMPLES 

We shall derive some necessary and sufficient conditions on the optimizing set 
for the detection problems II and III formulated in Part II-A. Some simple examples 
illustrating the usefulness of our results will also be given. A brief description of some 
optimization methods that we employ is given in Appendix I, and certain mathematical 
definitions and properties are listed in Appendix H. There will be no discussion of back- 
ground material here. 

2. 1 Conditions on Optimal Detectors of Problem II 


We start with Problem II where, for given {Pj}. we want to maximize 


Z tr. tr p 
j 33 


(530) 


subject to the constraint 


Z 7r. = 1, 

j J 


(531) 


by choosing the positive semidefinite self-adjoint bounded operators on 3C. We shall 
derive our results for Problem II from its dual Problem lid. For this purpose, we con- 
sider the Banach space of trace-class operators t C 88 , where SB is the normed lin- 
ear space of all bounded linear operators on 3C. Let S C r be the normed linear space 
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of self-adjoint finite trace operators over TR . Let P be the positive cone of positive 

137138 

semidefinite operators in S which defines the partial order ’ It is obvious that 

P is indeed a closed convex cone in S. [See Appendix H for definitions of these 
terms.] 

We suggest that the dual problem of II, which we call lid, is 


min tr. \, 

ies 


(532) 


subject to 


X 5* Pj J = 1 M 


(533) 


for given {pj}- We have to first establish a few points before we can proceed. The fol- 
lowing lemma will be used frequently. 

Lemma 3 


Let x^.x,, be two positive semidefinite self-adjoint operators on an arbitrary 
Hilbert space. Then 
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tr. XjX 2 ^ 0 


and (534) 

tr. XjX 2 = 0 

if and only if 

x x = *2 X \ = (535) 

Proof : We need to show first that tr. x j x 2 is rea T This follows from 

£ t ft 

(tr. x^) = tr. (XjX^' = tr. x 'x^ = tr. x^ = tr. x^. , 

Every positive semidefinite self-adjoint operator admits a unique positive semi- 

138 2 2 

definite square root such that Xj = a . Also let x 2 = b so that 

tr. Xj x 2 = tr - a 2 b 2 = tr. (ab)^ (ab) $ 0. (536) 

It follows also from (536) that tr. x x S 0 if and only if ab = 0 so that the lemma follows. 

>!« * ^ 

Consider the dual space S of S. The elements of S can all be represented as 

tr. 7tx (537) 

136 139 

for x e S and 7 r £ V, where V is the space of self-adjoint bounded operators. 

Also, it is clear that for each such defines a bounded linear functional on S. This 

* 

representation of elements in S is crucial for concrete application to our problem of 

Theorem I. 1 in Appendix I. The conjugate cone P C S corresponds to positive semi- 

’ £ £ 

definite self-adjoint bounded operators; that is, x G P can be represented as (537) 
with n also positive semidefinite. We now want first to establish existence for Prob- 
lem lid, which is more important to us than uniqueness. 


Lemma 4 


A solution to Problem lid exists and is unique. 


13 6 

Proof : Consider the larger Hilbert space of Hilbert -Schmidt operators Z, 

3S D Z D r. Problem lid can be formulated as a minimum norm problem on Z. The 
constraints define the domain of optimization as a closed convex set in Z. Let D be 
the set of positive semidefinite self-adjoint operators in Z which satisfy (533). D is 
obviously convex. Since each set {\|\5 Pj} is closed for every j, D is also closed. Thus 
Theorem I. 2 in Appendix I can be applied to yield existence and uniqueness for 
\ G D C Z. If we write V = we see that the minimum is certainly finite. 

j J 
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Therefore X is of finite trace and so is in S. 

After these preliminaries we may now state a theorem. 

Theorem 14 

There exists a set {^j} which can be used to solve Problem II. The necessary and 
sufficient conditions for this optimizing set, in addition to the constraints, are 

7T j (X-p j )=0 Vj (538) 

for an\£ S such that , 

X £ P • Wj. (539) 

Proof : We apply Theorem I. 1 directly to Problem lid. Our f is a linear functional 

tr. X defined on S, and the constraint mappings are also linear. It is clear that all con- 
ditions of the theorem are satisfied. Thus we have 

min tr. X = max min {tr. X + Z tr. 7T • (p • — X. )}, (540) 

XS=p. 7T.50 X£S j J J 

by using the linear functional representation (537) on S so that vr EE V. Here 7r S 0 is 
also defined with respect to the cone of positive semidefinite operators in V. The right- 
hand side of (540) can be converted to our Problem II. 

min tr. X= max tr. Zwp.. (541) 

X^ Pj tt.^ 0 j J J 

Z7T-1 
J J 

The existence of Problem II is therefore given by Theorem I. 1 
the constraint (Eq. 527), we have 

Z tr. (X-p.)7T. = 0. 
j J J 

By Lemma 2, (542) immediately gives 

(X " P J )7r j = 7r j (X_P J ) = ° (543) 

with X > pj, j. By Lemma 3 such a X exists, and the necessity part of the theo- 
rem follows. To show sufficiency, we note that in general 

tr. X ^ tr. Z7T.p. . (544) 

j J J 

which follows from (531) and (533). Thus the set {77\} which satisfies (542) achieves a 


Furthermore, using 
(542) 
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maximum for (530), and sufficiency is demonstrated. 

Since we know that the solution of Problem II is not unique, Theorem 14 provides 
a rather complete characterization of Problem II. We have also the dual problem of 
choosing one variable, subject to M constraints, whose solution would provide valuable 
hints on the solution to Problem II. This dual problem may appear to be easier to handle 
than a unique one. Of course, we can always attempt to solve the system . 


7r(X- Pj )=0 tyj (545) 

X * Pj V J (546) 

Z 77. = 1 (547) 

j J 

iL * 0 V j (548) 


which is quite difficult in general, and may not be useful for our original problem. 
(Recall the discussion on the nature of Problem II in Section II-A. ) 

Several interesting properties follow directly from (545)-(548). First, we note that 
X, by summation over j on (543), is 

X = Z p.7 r. = Z 7T p (549) 

j J J j J J 

This equation is already a condition on the solution set With equation (549) the sys 

tem of operator equations and inequalities (545)-(548) is also transformed to a system 
with variables {7 l} and {pj}only. Besides application to Problem III, which will be dis- 
cussed, Theorem 14 yields immediately the following corollary. 

COROLLARY. Suppose that we have found X£ S and a complete set {|x)} such that 
(546) is satisfied and 

<x | X |x> = max <x | p |x), (550) 

j J 

then the original Problem 0 is solved by measurement of { |x)}, together with a non- 
random strategy. 

Proof : Given { |x)} and X, let us expand 

= / » j( x) |x> <x| dx. (551 ) 

Consider 

tr. (X-Pj)JTj = / 7 J\(x){ ( x | X |x) - (x | P j jx)} dx. (552) 
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Employing nonrandom strategy such that 7Y(x) 5 0, 


)g( j and Z 7T (x ) = 1, we can set 

j J • 


(552) to be zero for every j, which is then equivalent to (551) by Lemma 3. Theorem 14 
then insures that (551) provides a solution to Problem II and so to Problem 0, since 
(531 )-(535) are satisfied. 


2. 2 Conditions on Optimal Detectors of Problem III 

We now consider the problem, which we call III, of maximizing (530) by choosing 

orthogonal projection operators {77^}, subject to (53 1 ). The constraint (Eq. 526 or 

Eq. 527) makes our problem nonconvex, so that it is difficult to establish existence or 
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global sufficient conditions by use of Kuhn- Tucker theorems. We have then to apply 
local conditions by taking derivatives. The following theorem is proved in Appendix I. 

Theorem 15 

A necessary condition for {7r} to solve Problem III is Eq. 549: 

X 7T . p . = Z p .7 r .. 
j 3 1 J 

A sufficient condition is given by the following theorem. 

Theorem 1 6 

A sufficient condition for {ff} to solve Problem III, apart from constraints, is 


2 7T p = Z p 7T . 

j JJ j J J 


2 7T.p. ^ p . 
i 11 J 


V J- 


(553) 

(554) 


The solution so found will also solve our original problem. 

Proof : For a set {tt} satisfying (553) and (554), we see that with 

x Yj • f Vi 

the necessary and sufficient conditions of Theorem 14 are satisfied if we apply 
Eq. 526. 

There are actually some more restrictive necessary conditions than Theo- 
rem 15. We do not list them here because, at present, they are in a more 
complicated form, and we think that in a final analysis the sufficient conditions 
of Theorem 16 are also necessary. It is then important and interesting to estab- 
lish existence for Problem III explicitly. We now turn our attention to some 
simple applications of our results. 
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2. 3 Simple Detection Examples 


4 1 44 

A known solution of Problem I has been found previously ’ for the special 


case in which the density operators pairwise commute. That is, 


[ p r p j’ p k“ p ^ = °- *Mi, j,k, £). 


(555) 


Optimal detector specification and evaluations have been limited to such particular sets 
of given p, In this case the observable to be measured has eigenvectors that form the 
simultaneous diagonal representation of - p^. It is straightforward to show that the 
detection operators n . so constructed satisfy the sufficient conditions of Theorem 16 


J 

when the p^ satisfy (555). 
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To demonstrate the usefulness of our result, we have then 

to consider a given set {p .} which does not obey (555). Some such simple sets {p,}will 
53 J J 

now be discussed. 

When the ranges of p^ span a finite dimensional space only, the operator system 
(545)-(548) reduces to one for finite matrices.. Let us consider a particular case in 
which we are given M pure states p . 


Pj = |j> <J 


3 = 1 M, 


(556) 


where the vectors |j) are linear-independent. The projection operators it. can be 
chosen to have one -dimensional ranges 


7L = iPjXPjl j= !>•••> M. ' (557) 

Applying the necessary condition (Eq. 549) in the |fL) -representation, we obtain imme- 
diately the equations 

<P m |m> <m|p n >= <p m |n) < n |P n ), m, n = 1 , . . . , M (558) 

which have to be solved, together with 

< i |j> = 2 <i|P n > <P n |j>. i.j=l M - (559) 

Note that the system (,558)-(559) implies M(M+1) equations with the same number of 
unknowns. The solution should therefore be optimal in general if an optimum solu- 
tion exists at all. It is difficult to check the sufficient conditions in general, but 

we believe that they are automatically obeyed in this case. Note that the necessary 

50 

condition (558) has also been derived before in a different manner. 

Consider a special case in which the given |j) { j | obey 


< i I J > = V 


i * 3 


(560) 


for a real constant y independent of i and j. A detection basis where the optimal 7T 
are constructed can be formed from |p^) which satisfy 


<i|Pi> = a 
< J |P ± ) = b 


i * ], 


(561) 

(562) 


where a and b are real constants independent of i and j. [This particular structure 
was suggested to the author by Dr. Jane W. S. Liu.] These constants are solutions of 

a 2 + (M-l)b 2 = 1 

(563) 

2ab +'(M-2)b 2 = y. 

It can be checked that (558) is obeyed for such it . = Ip.) (p.l. The sufficient conditions 
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(554) for this case have also been shown to be satisfied. Generalization of this 
example to the case of complex y, a, and b is straightforward. 

It can be seen from these examples that our theorems have at least the virtue of * 
enabling verification of conjectured detection operators. The sufficient conditions (554), 
however, are usually hard to check, especially when the problem does not possess some 
kind of symmetry. Further work in the simplification or reduction of the sufficient con- 
ditions is indeed warranted. 


2. 4 Conclusion 

We have given some necessary and sufficient conditions for the optimal detection 
operators of Problems II and III. They do not completely characterize the optimal 
detectors of our problem. It should be possible by the same kinds of techniques 
or simple extensions of them to generalize considerably these results to yield a more 
complete solution for the original problem. 

Nevertheless, it is meaningful to ask for solutions of our system of operator equa- 
tions and inequalities specifying the optimum detector. Although methods for dealing 
with such systems regarding both existence of solution and procedure of solution, do 
exist, 4 4 * ’ ^ they do not appear to be directly applicable to our situation. It 

should be fruitful to develop an efficient procedure, for the solutions of such sets for both 
general and specific {Pj}- When 3C is finite -dimensional the conditions above would be 
only on finite matrices where additional methods are available. A numerical solution 
of the optimizing conditions would not be useful for receiver implementation, at present, 
although it would provide a bound on error performance. ' , 

The examples that we have considered all pertain to Theorem 16. They suggest, 

(554) 

together with our general results, that the sufficient conditions should follow from 

the necessary condition, the constraints, and the obvious optimal choice. A proof 
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of this important and convenient result has not yet been given. Further applications of 
our theorems should be exploited, both for the determination of general optimal detector 
properties and for performance evaluations of realistic channels. 
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C. OTHER PERFORMANCE OPTIMIZATION PROBLEMS 

We shall now treat briefly some quantum optimization problems in other areas of 
communication theory. This will include mainly certain considerations of estimation 
and channel capacity. Our pace will be quite rapid, indicating just final results. The 
derivations are often straightforward. A summary of Part II will be given at the end of 
this section. 

3. 1 Estimation of Random Parameters 

The general optimal self-adjoint operator for estimation of a single random param- 

46 

eter was first worked out by Personick. The following theorem can be applied to yield 
the corresponding optimal operator without self-adjoint restriction. 

DEFINITION. A pseudo -Hilbert space is a linear vector space X, together with a 
pseudo inner product defined on the product space XXX. Corresponding to each pair 
of vectors x, y in X the pseudo inner product (x,y) of x and y is a scalar, taken to be 
a real number. The pseudo inner product satisfies the following axioms: 

1- (x,y)=(y,x) 

2. (x+y, z) = (x, z) + (y, z) 

3- (Xx,y) = \(x,y), X G TR 

4. (x, x) 2 0. 

The corresponding pseudonorm will also be denoted by double vertical bars. The only 
difference between our pseudo inner product and an ordinary inner product is that 
(x, x) = 0 does not imply x = 0 in our case. Our pseudo-Hilbert space is then a pre- 
Hilbert space whose inner product is a pseudo product. The following theorem is 
a straightforward generalization of the ordinary projection theorem. Proof of 
this theorem will be omitted. 

Theorem 17 ' 

Let X be a pseudo-Hilbert space, M a subspace of X, and x an arbitrary vector 
in X. A necessary and sufficient condition for m Q G M to minimize || x— M || , m G M, is 
that the error vector x - m Q be orthogonal to M. That is, 

(x-m , m) =0 ■ V m. 

Suppose that in the estimation of a single real random parameter we choose to 
measure X, whose eigenvalue x we take to be the estimate of a. The mean- 
square error is 
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( 564 ) 


e 2 = tr. p (X-aI)(X-aI)t, 

3 . 

where p is the density operator describing the receiver input. Let p(a) be the a priori 

3 

distribution of a, and let 

P = / P(a) P da (565) 

d 

‘ p x = / ap(a) p a da. (566) 

A straightforward application of Theorem 17 defines the optimal X Q that minimizes (564) 

by 

pX o =P 1 . (567) 

When p is positive definite, a solution X q clearly exists: 

X o =p _1 P 1 (568) 

which can be shown to be unique. Note that (567) can also be derived by the gra- 
dient operator method discussed in Appendix I. 

The. drawback of condition (567) is that the optimal X q so found may not be mea- 
surable. Again it is difficult to include measurability constraints in a simple way. 

Estimation of two real parameters can be equivalently formulated as a problem 
of estimating one complex variable. In such a case our above formulation carries 
over directly and the optimal observable is again given by (567), which should now 
be non-Hermitian. Measurability questions come up as in the single parameter case 
above. 

One way to insure measurability in this two-variables case is to allow for two 

self-adjoint observables X^ and X^ whose eigenvalues correspond to the parameters to 

be estimated. We then impose in the optimization problem the constraint that Xj and 

X 2 commute. While optimizing conditions can readily be developed, they cannot be solved 

in general. Similar comments apply to the case of multiple parameter estimation. 

Bounds of the Cram£r-Rao type on the mean-square error can be derived in this gen- 
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eral case similar to the situation where only self-adjoint observables are allowed. 

They will not be further discussed here. 

3. 2 Estimation of Nonrandom Parameters 

42 

A Cramfer-Rao bound was first given by Helstrom for estimation of a nonran- 
dom real parameter, again restricting the observables to be self-adjoint. The 
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following bound results when we relax the Hermiticity condition. . We first define 

~ 2 

the mean square error e in estimation of a possibly complex nonrandom parameter a 
to be 

e 2 = tr. P a (I-a)(X-a)i' _ (569) 


and also define the operator L whose adjoint obeys 
9P 


9a 


= lV 


It is then straightforward to show the following theorem. 
Theorem 18 

The mean-square error (569) is bounded from 
~Z ^ 1 


tr. p LL 
a 


t’ 


(570) 


(571' 


where the equality holds if and only if 

L = k(a)(X-a) (572) 

for some function k(a) of a. 

^ The difficulty with condition (572) is similar to that associated with (567), namely 
the optimal observables so found may not be measurable. Note that our formulation here 
includes estimation of two real nonrandom variables. It is difficult to generalize the 
bound to the multiple parameter case when the corresponding observables do not com- 
mute. 

3. 3 Channel Capacity 

With p given for a set of digital messages, we can write for measurement of X the 
average mutual information 

< x | Pj | x > 

1= Z Z P(j)<x|p |x)log ; — , (573) 

j x J Z p(j) <x [ Pi [x> 

j J 


where the summation notation for x can be interpreted as an integral when x is a 
continuous variable. The a priori probability for message j is denoted by p(j). We 
now define the channel capacity to be 
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C = max_ I. 

P(j);x 


The maximum is taken over all input probability assignments and all possible measure- 
ments. With this capacity for discrete memoryless channels it is straightforward to 
show that both the coding theorem and its converse hold. 


Theorem 19 


With data rate R smaller than the capacity C of a quantum channel defined above, 

the probability of error for digital information transmission through the channel can be 

made arbitrarily small by proper encoding and decoding. Conversly, when R exceeds 

C the error probability is lower-bounded from zero. 

An upper bound on the channel capacity for any measurement observable has been 
24 25 

conjectured by Gordon and rigorously proved by Zador. It was known that the bound 

can be achieved when the pj commute among themselves. By examining Zador 1 s proof 

in further detail, we have been able to show that the bound can also be achieved when 

the p^ pairwise commute as in Eq. 555. In this case the observable that maximizes C 

is the same as the one that minimizes detection error. We would conjecture that this 

coincidence may turn out to be still valid in more general cases, although there is no 

more than a weak-bound argument in support of this, at present. 

In addition to the obvious convexity properties as in the classical case, the average 

mutual information I can be shown to be a convex U function of the p . Consider p to 

1 ~ 

be a vector with components p^ which are density operators. Then a function F(p) is a 
convex U function of p if 


F^p/J^pFlp 1 ) 

N i ' i ~ 

when {p 1 }is a set of density operator vectors, and {p 1 } is a probability vector. This 
convexity property of I as a function of p is the direct analog of the convexity of ordi- 
nary I as a function of the channel conditional probability. They are not going to be 
discussed further here. We note that there is an interesting open problem which con- 
sists in determining the optimal measurements as a function of source rate for the sys- 
tem reliability function. 1 One may then obtain a general quantum system reliability 

function. This problem appears to be extremely difficult. 


3. 4 Other Problems 

There are clearly many other classical communication theoretical problems 
that need quantum analogs. In the estimation area an outstanding problem is 
the development of a proper maximum -likelihood quantum estimate (MLQ) for ‘ 
both random and nonrandom parameters. We suggest that the observable X which 
satisfies 


133 


9p 

8a 


( 574 ) 


_= 0 
a=X 

be called the MLQ because of some interesting properties that it possesses, but we shall 
not discuss it here. Its treatment may be found elsewhere. 

Similarly, it is very interesting to develop the quantum counterpart of ordinary 
Wiener -Kalman filters. While we have not been able to produce any useful results thus 
far, the existence of such quantum filters appears to be promising. Certainly, there 
is plenty of room for these and other areas in quantum communication theory. 

3. 5 Summary of Part II 

In Part II, we have considered some problems relating to optimal performance of 
communication systems under different criteria. Our attention was mainly directed to 
the M-ary detection problem, the major results of which were some necessary and suf- 
ficient conditions of the general optimal detector. While interesting in themselves, 
further consideration is required in applying our theorems in Section II-B to actual 
evaluations of system performance. 

A major difficulty in our optimization problems is that it is hard to express con- 
veniently the constraint of measurability on the observables that we optimize over. It 
seems that an accurate determination of the class of measurable observables will be an 
important basis for system optimization. We hope that this can be achieved by extending 
the analysis of Appendix E. 

It should be clear that there are many interesting open problems of performance 
optimization in a general quantum communication theory. The importance of such a 
theory will be uncertain until it is properly developed. 
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D. GENERAL CONCLUSION 


In Part I of this report we have developed a general characterization of quantum 
communication systems, including the channel and the transmitter-receiver configura- 
tions. In particular, a procedure is described which under certain conditions yields 
the canonical quantum equivalent of a given classical space -time varying linear random 
channel. We have thus provided a comprehensive framework with which density-operator 
receiver input representation can be readily obtained for various communication sys- 
tems. 

In Part II we have established some results concerning the optimization of system 
performance under various criteria. In particular, the general conditions that we pro- 
vide on the optimal digital detector can be taken as a basis for the development and 
evaluation of optimal quantum receivers. In conjunction with Part I we have therefore 
provided some broad principles that are necessary for general quantum communication 
analysis. 

The framework presented in this report is not all encompassing, however. It 
is therefore appropriate to indicate promising areas for future research. These 
include extensions and generalizations of our present work, as well as other topics 
which we have not discussed. 

In the area of communication modeling the most outstanding unsolved problem 
is the general development of a proper field commutator at the channel output from the 
given classical information only. This should be possible, as mentioned in Part I, 
either by employing a more detailed mathematical analysis or making more explicit 
physical assumptions. Many other generalizations are possible, but they appear to be 
minor in comparison with this problem. When the general field commutator is found, 
the quantum issues in communication system modeling will have been completely cleared 
up. This does not mean that the classical quantum transition will be direct and trivial 
for any classical channel because the classical information has to be given in proper 
form for application of our correspondence. 

Many more problems remain to be solved in the broad field of system optimization. 

In fact, there are almost as many different areas in quantum communication theory as 
there are in classical communication theory. Only a few of them have been discussed 
in this report. Quantum receiver implementation also raises problems that have no 
classical analogs. All of these problems are challenging and deserve attention. Par- 
ticular areas that we have not touched upon, but can readily be treated by our methods 
or their simple extensions, include signal design, linear filtering, and other fields of 
analog communication theory. 

Much work also remains to be done in the development of appropriate procedures 
for evaluation of realistic system performance from the general principles., In particu- 
lar, we need practical methods of solving systems of operator equations. Knowledge 
of operator inequalities will also be very helpful. 
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Finally, let us observe that while our theories will be most useful only when optical 
communication systems are sufficiently well-developed, they can actually indicate fruit- 
ful areas of device research for applications to such systems. 
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APPENDIX A 


Mathematical Framework of Quantum Theory 

We shall give a very brief treatment of the mathematical structure of quantum theory 
that is most frequently employed. We shall also define a few special notions that will 
be used in the main content. A common mathematical description of quantum formalism 
has been provided by von Neumann, 5 Dirac, and Rosen. 112 Alternative and general- 
ized schemes have been given by Jauch,^ and others. ^ ^ Introductory discussions 

74 111 142 

with physical details can be found in Louisell, Dirac, , and in textbooks. 

A. 1 QUANTUM STATES 

A physical system is characterized by a quantum-state space which is the set of 
possible states in which the system is allowed to be. This set is generally taken to be 
a separable Hilbert space over the complex field $ , with vectors denoted by the Dirac 
kets 

h> (A. 1) 

and inner product between two kets | \) and | denoted by 

UU>. (A. 2) 

[For a summary of certain mathematical definitions and their elementary consequences, 
which are particularly required in Part II, see Appendix H.] Expression (A. 2) is equiv- 
alent to the usual Hilbert space notation 

(X. +). (A. 3) 

By introducing the concept of a bra vector 

<X|. (A. 4) 

notation (A. 2) has been found more versatile and convenient than (A. 3). The vectors 

a| *|>), a €= $ (A. 5) 

represent physical equivalent states so that one usually considers a state to be normal-, 
ized 

<i)j| ip) = 1. - (A. 6) 

Separability of the space is equivalent to the condition that there exist in the space 
countable, complete, orthonormal sequences of vectors. In concrete applications we 
usually need to choose a particular set of basis vectors, referred to as a representation. 

A. 2 QUANTUM OBSERVABLES 

Dynamical quantities of the physical system are represented by linear operators 
on 3C. Specifically, if A denotes a linear operator, then A| 4>) denotes the transformed 
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vector which is also in 3C. Linear operators are frequently abbreviated here as oper- 
ators. Adjoint of an operator A is denoted by A + . The set of linear operators on 3C 
forms an algebra. ** ^ The identity operator I leaves all vectors and operators 
unchanged, and so is the unit element in the operator algebra. It is convenient to refer 
to physical variables as "q" or "c" numbers, according to whether they are operators 
or just ordinary functions. Frequently a multiple of the identity operator is also called 
a c number. 

A dynamical variable is usually called an observable in quantum theory when its 

corresponding operator A is self-adjoint and possesses a complete set of eigenstates 

in 3C. We shall define observables, however, to include all operators having a complete 

set of eigenvectors. This complete set of eigenvectors may be complete orthonormal 
122 

or overcomplete, where overcompleteness for a set of states means that a proper 

5 8 87 

subset of states is already complete. The spectrum ’ ’ of A can also be discrete or 
continuous when it is self-adjoint, and be arbitrary when it is non-Hermitian. 

Projection operators occupying a central position in quantum theory are denoted by 

U><£| (A. 7) 

for projections into one-dimensional subspaces. At the heart of a full exploitation of 
the Dirac notation is the repeated use of the relation 

1= 2 U> <£| (A. 8) 


or 


i= / di |i) <je|, 


(A. 9) 


which is called a resolution of the identity. Equation A. 8 or A. 9 is valid for an arbitrary 
set of complete orthonormal basis vectors {|£)} in the discrete and continuous spectrum 
cases, respectively. Dirac delta functions are frequently employed to normalize the 
strictly non-normalizable eigenvectors of a self-adjoint operator A having a continuous 
spectrum. Such a procedure leads to correct results efficiently when used with proper 

85 

caution, similar to other use of distributions, or singular functions. In our applications 
such normalization is not needed. 

Our attention is directed primarily to the photon creation and annihilation oper- 
ators b + , b which obey the Bose commutation rules 

[b, b + ] = bb + - b + b = 1. (A. 10) 

119-121 

The annihilation operators b possess an overcomplete set of eigenstates in 3C, 


b|p> = p|p) 

jf IpXpI d 2 p = I 

<P| P’> = exp jpV-!|p| 2 --||p’| 2 j. 


(A. 11) 
(A. 12) 

(A. 13) 
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Each eigenstate j(3) is properly normalized with complex eigenvalue (3. Representations 

of states or observables based on the continuous set |(3) are called "coherent- state 

representations," since |p) are commonly referred to as coherent states with proper 
119 123 

justification. ’ While we shall certainly not only consider the observables b and 
b + but also arbitrary functions of them, we shall employ the coherent -state representa- 
tion when we need to use one. 

A. 3 QUANTUM MEASUREMENT INTERPRETATION 

‘Physical interpretation of the states in 3C goes as follows. When the system is in 
l^ 1 ) quantum measurements^ ® on the observable A would yield possible results a, a 
point in the spectrum of A, with a probability density 

|<«k>| 2 . (A. 14) 

We have implicitly assumed that the eigenvalues a in (A. 14) are nondegenerate. In gen- 
eral proper modification can be made by summing over the degeneracies. The mean 
observable value of A is always 

<A> = < + | A |+>. (A. 15) 

This interpretation is commonly held as postulates of quantum theory when the 
observable A is self-adjoint. When A is not, the validity of the interpretation is uncer- 
tain. At least in the case of boson operator b (discussed previously) these interpreta- 
3 9 

tions can be shown to hold (see also Appendix E) 1 . It is partly for this reason that 
we have broadened the traditional meaning of an observable (see Appendix E). 

Higher observed moments of A are given by 

<A n > = (i|<| A n |ip). (A. 16) 

Similarly to (A. 15), this is consistent with (A. 14) through application of the spectral 
representation of A . If A is unbounded, then (A. 16) may be undefined for certain states. 
The characteristic function for the distribution of a self-adjoint A in the state |i|j) given 
by 

<t> A ^) = <+1 e ltiA |4>> (A. 17) 

is always defined for any |^). (The characteristic functions in non-Hermitian A cases 
are discussed in Section C. 3. lc.) 

From our interpretation it can be seen that the measured values of an observ- 
able A will have a nonzero variance when 1 4 1 ) is not an eigenvector of A. Since 
not all observables can be simultaneously diagonalized for any state 1 4* ) because 
of noncommutativity, there are always some observables with a spread in their 
distribution. This is the essence of the uncertainty principle that dominates phys- 
ical reasoning in quantum theory. 
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A. 4 MIXED STATES AND DENSITY OPERATOR REPRESENTATION 


The states that we have discussed are usually called "pure states" to distinguish 
them from the mixed state described by a self-adjoint, positive semidefinite operator 
of unit trace. A pure state can be represented by these so-called density operators in 
the form 

P = |+> < + | (A- 18) 

so that (A. 14) becomes 

<a| P I a) (A. 19) 

and (A. 16) becomes 

tr (pA n ). (A. 20) 

A mixed state is a convex combination of pure states. That is, 

p = s pJV (A - 21) 

n= 1 

where 

p > 0 (A. 22) 

n 


oo 

2 

n=l 



(A. 23) 


Such a density operator describes an ensemble of pure states. Equations A. 19 and A. 20 

clearly retain their validity with the same interpretation. Thus, in general, a complete 

characterization of a quantum system is given by a density operator. Further general 
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properties and various applications of density operators can be found Fano, 

74 144 5 

Louisell, ter Haar, and von Neumann. 


A. 5 DYNAMICAL STRUCTURE 

For a conservative system described by a Hamiltonian H, the dynamical equation 
governing the system behavior is given by the Schrodinger equation 

dp 

ifi-^=[H, P ] (A. 24) 

so that the density operator p(t) is time -dependent. This scheme is called the 
Schrodinger picture, abbreviated as S- picture. 

In contrast, another description, called the Heisenberg or H-picture, is obtained 
if we retain the states or mixed states fixed but instead change the observables in such 
a way that all expectation values are identical with those calculated in the S-picture. 
Thus we introduce new time -dependent operators representing observables, determined 
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in such a way that 


tr p(t) A = tr p(o) A(t). 

Since a general solution of (A. 24) can be written 

... -iHt/fi , . iHt/fi 
p(t) = e ' p (o) e ' . 

We obtain 

A(t) = e IHt/ * A 


(A. 25) 


. (A. 26) 


(A. 2.7) 


which is the time dependence of an observable in the H-picture. 

Other pictures can also be formulated in a similar manner, but we ‘shall not discuss 
them. The interaction, or Dirac, picture was found particularly useful for many prob- 
lems. 

In our treatment we usually assume in a dynamical problem that we are using the 
S-picture when we talk about density operators and the H-picture when we talk about 
observables. 

When the Hamiltonian H in (A. 24) becomes time-dependent, the evolution of p(t) is 
still given by (A. 24). For other nonconservative systems the precise form of the equa- 
tions of motion is not known in general. They can be obtained in various ways depending 
on individual problems. 
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APPENDIX B 


Treatment of the Vector Markov Case 


We shall discuss the vector Markov case as a generalization of the strict 
Markov case treated in the main content of this report. This vector Markov 
case may be unimportant for the following reason. When the system is linear 
and is described by a total Hamiltonian, the equation of motion for the funda- 
mental field variables would usually involve time derivatives up to second order 
only. If a Markov approximation can be made, the equations will then contain 
only first-order time derivatives and therefore become strictly Markov. We treat 
the vector Markov case here for generality and for possible situations where 
higher order derivative loss terms occur. 

We discuss directly the quantum case that is readily specialized to the classical 
situation. Consider the differential equation 

b k (t) = f k (t), (B. 1) 


where the correlations of the noise operator f ^(t ) are given by Eqs. 213-215. 
Suppose that-Sfj is of the form 


jn 


m-1 


+ a. (t) - 

dt n 1 dt n 1 


+ a n (t) 


so that with 


a > = X k + a n (t) 


the differential operator ^ is expressed by 


,n ,n-l 

X, + if, = - 3 — +a (t) -2— r 
k 1 dt n 1 dt n_1 


+ -" + a n-l (t) dt + a n (t) - 


(B. 2) 


We again let h^(t, t) to be the zero-state impulse response of (B. 1). We have 
turned off the excitation at (B. 1) for simplicity so that the mean of b^(t) van- 
ishes. Our following result can be immediately interpreted when e^(t) is present, 
by letting b^(t) be the operator with its mean subtracted. 

Let the vector XjjW be defined by 
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r* th. 

Here, b^t) denotes the r -order derivative of bjjt). For each k the vector X^(t) will 
be a Markov state vector. Define the noise vector 



We can put (B. 1) into the state -variable form 


dX, 
— k 

dt 



(t)x k 


+ f k<*>‘ 


(B. 4) 


(B. 5) 


(B. 6) 
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It should now be clear that manipulations can be carried out for X k (t) identical to those 
for b^(t) in the strict Markov case. We shall therefore only give the final results of 
importance. 

The state transition matrix h^(t, t) obeying 


|IS + ik (t) 


h k (t, t) = 0, 


t > T 


h, (t. t) = I 


is found in terms of h k (t, r) as following 2(n-l) X 2(n-l) matrix 


h(t, t) = 



(B. 7 ) 


The solution vector is 


x k (t) = h k (t.t o )x 1 R (t 0 ) + 



h k ( t, T) f k (T) dr 


(B. 8) 


° r 

X k (t) = J h k (t. T) f k ( T ) dT (B. 9) 

under the assumption that the initial distribution arises from f k ( T )- Fluctuation- 
dissipation relations can be written down similar to Eqs. 234-236. These and other 
relations can be explicitly obtained by expansion when desired. 

One time preservation of the commutator 
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/ 


b k (t), bj(t) 


1 


can be achieved by solving the integral equation (280), but the solution is now not given 
by (280). Its explicit form is not important, however. The two-time commutator then 
follows with^the form 


b k (t) ’ b k (t,) 




dT 


T=t* 


I <-»" 


-r d 


n-r 


r=2 

n-1 


dr 


n-1 k 


h, (t, t) 


< b k 1 ( t ’) b k (t,) > 


T— t ' 


, ,r ,n-l 

- ) (-l) n 1 — 7 h,(t, t) 

4 dt r dr 11 ' 1 k 

r=l 


< b k(t , )bJ(f)). 

T=t 1 


(B. 10) 


The one-time averages occurring in (B. 10) can be computed from hjjt, t) and the dif- 
fusion coefficients through Eq. 235. The field commutator can also be obtained from 
(B. 10), although it will now be quite messy. The equal-time commutator is simple, 
however, and from (B. 10) is 


V r,t), ^P (F,,t) ] = I *k (?) fn=l 


T=t , 


d^ 

dt 


n=i E V r) \ (r "> h k (t ’ T) 

k 


t = T , 


= 6(r-r'). 


(B. 11) 


Equations B. 10 and B. 1 1 are all that are really required in our generalization of the 
Markov case to the vector Markov case. 
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APPENDIX C 


Fluctuation-Dissipation-Amplification Theorems 

We shall give a deeper discussion of the fluctuation-dissipation theorem^^’ 68,104-107 
employed in Part I and indicate possible generalizations to our general case. We shall 
also try to interpret the theorems for applications to amplifiers other than atten- 
uators. 

C. 1 FLUCTUATION -DISSIPATION THEOREMS 

We have only employed fluctuation -dissipation theorems for establishing the field 
commutators, while at the same time these theorems actually give all of the two-time 
quantum averages. This puts some constraint on the classical process with mean 
and covariances given separately. It therefore appears that not all given classical pro- 
cess would obey fluctuation-dissipation theorems. Since such theorems seem to be 
quite generally applicable and useful both for field commutator specification and for 
other purposes, we can first observe their nature more closely and then see what 
kind of conditions are required for their applicability. 

The fluctuation -dissipation theorems in the Markov or vector Markov cases are 
direct mathematical consequences of the Markov character of the processes. No phys- 
ical assumption is required for their validity. On the contrary, the stationary system 
fluctuation-dissipation theorems are derived from the so-called linear response the- 
ory, 10 ^*’ 107 which applies to the system plus its environment so that the total sys- 
tem is describable by a Hamiltonian. The form that we use in this report can readily 

6 ) 8 

be obtained from M. Lax. The point to be observed here is that once the mean equa- 
tion of a system observable is known, the two-time fluctuations are also determined, 
regardless of the details of the reservoir. While elegant in its interpretation and 
rich in its applications, this theorem is unfortunately restricted to stationary pro- 
cesses. 

The nature of the derivation of these stationary fluctuation -dissipation theorems sug- 
gests that it is fruitful to consider a stochastic system as part of a conservative system. 
Some physical assumptions may be involved in such a description. It will be extremely 
useful if we can then derive some system statistics from the system mean equation 
independent of the detailed reservoir behavior. Preliminary consideration for gen- 
eralizing these theorems to the time -variant case results in certain analytical 
difficulties. It appears nevertheless that such generalizations are quite viable. 
Another possible route for such generalizations lies in exploiting the mathematical 
structure of particular classes of random processes. We feel that a development 
from the physical point of view is likely to be more generally applicable. The phys- 
ical assumption would illuminate rather than restrict their application to individual 
problems. 
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C. 2 FLUCTUATION -AMPLIFICATION THEOREMS 

The usual stationary fluctuation -dissipation theorems have not been interpreted or 
modified to apply to situations in which the system energy is amplified rather than atten- 
uated. We introduce the term fluctuation -amplification theorems to indicate relations 
that apply to amplifiers that are similar in spirit and in content to the usual fluctuation- 
dissipation theorems. Such theorems clearly exist in the Markov case, even quantum - 
mechanically. By proper reinterpretation of the usual stationary fluctuation -dissipation 
theorems, they should also be applicable to amplification situations, although in such 
cases the physical nature of the system is more uncertain. 

To indicate how such an interpretation may be possible, let us consider Eq. 258 
whose right-hand side has to be positive. Let the temperature T in n(co) of Eq. 260 be 
negative, andjS?^(u>) also be negative in the frequency range of interest. Such a neg- 
ative imaginary part of the 'susceptibility' can be readily seen to imply amplification. A 
negative temperature also changes the dissipative environment to an amplifying one. 
We can therefore retain Eq. 257 as consistent when applied to amplifiers. Generali- 
zation similar to the fluctuation -dissipation case discussed above should also be pos- 
sible. 

In this connection let us note that the positivity of Eq. 258 puts a fundamental limit 
on the noise behavior of our system considered as an amplifier. Let us write 

1 

n(«; = : :>o (c. l) 

-Wk B |T | 

1 - e 

for a negative temperature T^. We have 


< Fj(u)F k («i)> = 2R ti(co) |j£? k (u) | (C. 2) 

< F k (“)Fj(u)> = 2fi{n(w)-l}|jSf k («)| (C. 3) 

so that 

r\(a)*l. (C. 4) 

The minimum noise results when 

h(w) =1 (C. 5) 


whose physical origin is spontaneous emission. 

This limit on the minimum noise present in our system appears to be general, at 
least for systems of the kind considered in the derivation of Eqs. 256 and 257. It should 
be worthwhile to examine further the generality of (C. 2)-(C. 3) when applied to amplifiers 
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because we may then access the fundamental noise limit of linear stationary amplifiers 
in complete generality. Note that the case of a time-variant linear amplifier may have 
quite different limits. 

General considerations of this kind for amplifiers are also important for our pur- 
poses, as we recall from Parts I-E and I-F that they influence the specific form of our 
receiver input density operator representations. In particular, when we implement the 
integral of Eq. 36 1 by a matched filter as in Eq. 365, the filter introduces an additive 
noise obeying (C. 2)-(C. 4). 

Let us note that the fluctuation-dissipation-amplification theorems are not nec- 
essarily required for specifying field commutators. In fact, we have indicated in sec- 
tion 4. 2 (Part I) how the commutator can be determined from the system representation 
In spite of this, we feel that the development of fluctuation -dissipation theorems is of 
general importance because we can obtain further nontrivial information on the sys- 
tem without additional essential assumptions. 
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APPENDIX D 


Generalized Classical Quantum Correspondence 

We shall discuss briefly several generalizations of the theory presented in Part I-D 
and I-E. This involves the relaxation of many of the assumptions that we have made in 
our development. The possibilities of these generalizations should be quite apparent 
from our discussion. 

D. 1 NON -GAUSSIAN QUANTUM NOISE 

Our Gaussian noise assumption was made primarily to simplify analysis analogous 
to the classical case. While such an assumption is often justified, it can be relaxed if 
we impose other structures on the quantum processes. One such structure is a quantum 
Markov process defined in section 3. 2. 4. There are many non-Gaussian Markov pro- 
cesses even within the Fokker-Planck-Kolmogorov regime. If we allow a 'generalized' 
Fokker -Planck description, further non-Gaussian processes can be taken into account. 

In a classical quantum correspondence we can set all given classical diffusion coef- 
ficients to be the normal ordered quantum diffusion coefficients. A two-time commu- 
tator in such a non-Gaussian case is still given by our Markov results, as they depend 
in no way on Gaussian assumptions. It should be clear that all of our theory can be 
straightforwardly carried through in principle in this Markov case, although added dif- 
ficulties may arise in density operator calculations. 

D. 2 INCLUSION OF SPATIAL DISSIPATION 

Our assumption (Eq. 11) has been used only to the extent of simplifying analysis in 
a number of places. It is by no means essential. With the assumption about the noise 
source correlations that we have made in Eqs. 17 and 18 this assumption is not really 
required in most of our treatment. It will be required, however, in order to retain the 
simplicity of our development when the classical noise sources are spatially white. At 
the expense of considering coupled equations, we can always employ the noise normal 
modes. We shall see in Appendix F that there frequently exists a mode expansion 
with independent amplitudes, even when.Sf 2 is dissipative, so that Eqs. 17 and 18 
are in fact not unusual. 

D. 3 COUPLED SPACE AND TIME DERIVATIVES 

When the operator .Sf of Eq. 1 does not factorize as in Eq. 5 our treatment cannot 
be held to be valid. When a mode expansion is not needed it may be possible to regard 
the field commutator as given by the unperturbed Green's function, in the approximation 
that we discuss in Part I-F. Unfortunately, even the unperturbed Green's function would 
be very complicated in such a case. 
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APPENDIX E 


Theory of Quantum Measurements 

We shall give a brief development of quantum measurement theory which is of prime 
importance both for receiver implementation and for specifying a meaningful range 
of measurement optimization. Let us first indicate the nature of our following anal- 
ysis. 

E. 1 INTERACTION HAMILTONIAN ANALYSIS OF 
QUANTUM MEASUREMENTS 

When we make a microscopic measurement on a system, we invariably let it interact 

with a measuring apparatus which in turn produces a macroscopic trace as a result 

5-9 

of the interaction. We shall not reproduce the many discussions about the philos- 

ophy and physical nature of quantum measurements. It suffices for our purpose to 
note that the system-apparatus interaction, which is essential for quantum measure- 
ments, has to be treated in a quantum theoretical fashion as was emphasized by Bohr. 
(Actually for a classical measurement too, but one can assume that the disturbance of 
the system because of this interaction can be made arbitrarily small in the classical 
case.) Since it would be extremely complicated, even in the classical case, to treat the 
actual functioning of the measuring apparatus, we take a simple view that an appropriate 
set of apparatus observables will have a macroscopic manifestation or can be measured 
in some other way after the system-apparatus coupling. This is in accordance with 
the Copenhagen interpretation of quantum theory. Our measurement problem consists 
in elucidating this system-apparatus interaction in the measurement. The philosophy 
or nature of this interaction Hamiltonian approach for receiver implementation is actu- 
ally a more delicate problem which we shall not discuss further. 

The description of measurements by an interaction Hamiltonian was first introduced 
5 

by von Neumann, with a different purpose from ours. It has also been considered more 

37 38 39 

recently by Gordon and Louisell ’ with some generality. Arthurs and Kelly have 

given a particularly interesting example of such treatment. The following considera- 
tion is an extension and generalization of these works. 

We give a more quantitative description now. For simplicity, we restrict ourselves 
to the case in which the apparatus can be considered to be initially in a pure state, which 
we write 



We describe the system at the beginning of our measurement by a density operator 

s 

P 

where the superscripts have their obvious meanings. The measurement is carried 
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out by letting the system plus apparatus interact with an interaction Hamiltonian 


HpSp.s. i = 1 N 

i 


(E. 1) 


where the p^ are a set of commuting apparatus observables, and the s^ are a set of 
system observables. We are dealing in general with N apparatus degrees of free- 
dom. Note that this form of H T is very general. The usual impulsive -interaction 
6 

approximation can be made which says that H^. dominates the evolution of the sys- 
tem plus apparatus for a short time after they are coupled. At a certain short time 
afterwards one then observes an appropriate set of apparatus observables, the values 
of which are indicative of the measured values of some corresponding system observ- 
ables. If {q^} is the set of apparatus observables being looked at, then the probability 
of obtaining a set of values {q.} is given in general by 


P({q i }) = 


Tr 




(E. 2) 


where 


KlqJ) = <{q i }|u(t)|^ A > 

-itH /fi 

U(t) = e = exp 


Ifai}) = K> ••• K> ••• |q N > 



(E. 3) 


and the system is left in the mixture 


Pf(t) 


K {qj}) p 5 ! 1 ^}) 
P({qi}) 


(E. 4) 


by applying the projection postulate to the apparatus. (The projection postulate, first 

5 

formulated explicitly by von Neumann, states that if an observable X is measured on 
a system with a result x, then the system is left immediately after the measure- 
ment in the eigenstate |x) corresponding to x. We speak about a nondegenerate 
spectrum throughout for simplicity. The case of a degenerate spectrum can be 

easily included.) The probability in (E.2) is also the probability of finding the sys- 

s 

tern in mixture given by (E. 4). Note that p^.(t) would depend on an initial system 
state unless I({q^}), which is a system operator, factorizes into the form of a gen- 
eralized projection operator (dyad) 

i=h s ><0’ (E - 5) 


151 



in which case 


P({5i}) = <+“'! p s !+ t m > 


(E. 6) 


and 


Pf(t) = l^t) <+t I’ 


(E.7) 


Here the {q^} is parametrically related to the eigenvalues of |ij^) which is the eigen- 
state of a certain system operator. These results can be derived rigorously. These 
derivations are omitted here for brevity. We can also allow I to depend on |ip ) or 
not, by proper adjustment of |i|A), The case wherein I factorizes as it does above 

and does not depend on jifA) was called an ideal measurement by Gordon and 
37 3 8 A 

Louisell. ’ We can relax the condition of | ^ ) independence, however, which to us 
is no less "ideal" than the other case. 

If we consider ideal measurements (in either of the two senses mentioned), then we 
can say that the measurement scheme described above corresponds to the measurement 
of the system observable X. 

X|x) = x|x), 


with 


! x > = 1 ‘ 0 - 

in all of the known situations, it turns out that 

K>= IO- 


Note, however, that X does not have to be self-adjoint. The only requirement is 
that it have a complete set of right eigenstates. One would then tend to ask how we can 
measure such an X in an ideal measurement. This consists in finding an H T , a set of 
|{q.j}), and an ] i|» ), in our case, so that I 1 ! 1 ™) is the eigenstate of X. To this question 
we now turn our attention. 

E. 2 MEASUREMENT OF OBSERVABLES 

We now show explicitly how a quantum measurement can be accomplished in the 
framework described above. Let us consider the measurement of a self-adjoint sys- 
tem observable S. For this purpose, we choose 

Hj = P S (E. 8) 

for an apparatus observable p whose conjugate variable is q. From the Heisenberg 
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equation of motion, we have 


dq 
dt = 


S 


dp 

dt 


= 0 . 


so that q is the observable that we should measure. Let us, for simplicity, choose 



(E. 9) 


In this situation we arrive at 

I = |S=q-q Q > (S=q-q o |, 

where we have taken the measurement time to be one. The state |S = q-q Q ) is an eigen- 
state of S whose eigenvalue is q - q . An ideal measurement for S is now achieved 
by observing q. 

In a sense this discussion demonstrates that every self-adjoint observable can be 
measured in principle. On the other hand, the argument can be regarded as cir- 
cular, since we have now to observe q. When we can make a macroscopic record 
on the outcome q this interacting Hamiltonian method may be considered as a satis- 
factory way of doing quantum measurements. In any case we have illustrated the 
power of our approach and the kind of measurements that can be made with this 
scheme. 

Similarly, we can show that we shall be able to measure a system photon oper- 
ator b with 

[b.b 1 ] = 1, 
by choosing 

h i = Pi p + p 2 q (E. 10) 

whose p and p are two commuting apparatus observables, and P and Q are related 
+ 1 i 
to b, b ' as usual, 

b= — — (P-iwQ). (E. 11) 

In this case the conjugate variables q^ arid q 2 of p^ and p 2 , respectively, should 
be observed. Furthermore, the. apparatus initial state should be chosen as a prod- 
uct of two coherent states whose parameters are determined by w of (E. 11). It is 
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then straightforward to show that the system operator I of (E. 3) factorizes into 


|e><c|. 

where |p) is the eigenstate of b. A detailed derivation is omitted here. 

Assume that a system observable X possesses a complete set of eigenstates. In 
general we can write 

X = Xj + iX 2 , 

where both Xj and X 2 are Hermitian. The commutator 

[Ei . X 2 ] = 


will involve a Hermitian operator X3 which may not be a c-number. In this case we 
have not yet been able to measure such an X with our approach. It appears, however, 
that such an X should indeed be measurable in our sense. Further effort is required 
to fix up this important point. 
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APPENDIX F 


Relations of Linear Fields 


We shall derive certain relations between the fundamental field variables, in par- 
ticular those between the electric field <f Qp (r, t) and our 4 J 0 p( r .t)- Our relation here is 
general, and more explicit relations have to be obtained depending on individual cases. 
We first observe that given a commutator 



(F. 1) 


there will frequently exist, even when -Sf of Eq. 1 is not of the form of Eq. 5, 
modes 4> k ( r ) such that 

C € (?r';tt') = Z 4> k (?) 4> k (?') C k (t,t'), 
k 


spatial 


(F. 2) 


where 


C k (t,t) = ifi. (F. 3) 

Karhunen-Lo6ve expansion of the form (F. 2) holds for a given field C^rr'jtt') under 
rather general conditions. In such a case we can expand 


^op (? ’ t) = ^ 4, k (?)q k (t) 


(F. 4) 


i op (r,t) = Z 4> k (r) p k (t) (F.5) 

k 

[ q k (t) ’ q k (t) ] = (F.6) 


We neglect a possible multiplicative constant to co(r, t) which, depending on both the 
medium and the units, makes $ q ^y, t) the ordinary electric field. 

Let us introduce another set of mode functions $ k (r) 

V X $ k (F) = <j> k (F) (F. 7 ) 


so that the magnetic field, also up to a multiplicative constant, is 
JT (F,t) = Z (r) p, (t). 

Op k K K 

The commutator between & (r, t) and (r,t) is therefore 

i op op 

k op (F,t), jr 0 p(?'.t-)] = Z«, k (r) * k (f)[q k (t),p k (f)]. 

k 


(F. 8). 


(F.9) 
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We further define the photon operators b^(t) for k: 

b k <t) = — ■ 1 - (p k (t)-io>q k {t)} 

V 2Roj 


(F. 10) 


so that 


[b k (t), bj(t) = 


(F. 11) 


and consider the field variables 


V r,t) = k <t>k(r) b k (tK 


(F. 12) 


It is clear from (F. 12) and (F. 10) that 4* (r,t) is related linearly to S (r,t) and 

• op op 

<^ 0 p(r,t). An explicit relation appears to be difficult to find in general even if there 
is one, but can be found when either the time behavior of q^(t) or the spatial behavior 
of ^(F) is known, together with the dispersion relation co, . With generalized functions 
allowed, <^ 0 p( r >t) is related linearly to so that in general we can write 


+ (r,t)= / h(rt; r't') «? op (r',t') 


(F. 13) 


for a deterministic filter h(rt;r't'). Frequently either a spatial or a temporal filter 
is already sufficient for relating i|j (F, t) and S (r,t). 

r 1 

Suppose that a possible random Green's function G R (rt;r't') is given relating the 
input electric source field to the output electric field of a transmission medium 


* (r,t) = f G„(rt; r't') ® (r\ t') dr'dt' 

op rv op 


(F. 14) 


s — 

when both the signal and the noise sources are included in S (r,t). The random Green's 

np 


function G„(rt; r 't ') for iJj (r, t) is then 
R x op 

4 J nn (F , t) =/ G R (Ft; r't') (?', t') dF'dt', 


(F. 15) 


where 


oL(rt;r't') = / h(rt;r"t") G 0 (r"t"; r'"t"') h 1 (r'"t'",t'r') dr "dr '"dt"dt"' (F. 16) 

rt rv 


f h(rt;r"t") h *(r "t"; r 't ') dr "dt" = 6(r-r') 6(t-t ’). 

The filter h ^(rtjr't') is the inverse of the filter h(rt;r't'). 

We next assume that di (F,t) has the commutator 

op 

[V F ’ t) 'V F, ’ t,)] = S (Ft;F ' t ' ) 


(F. 17) 


(F. 18) 
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The commutator C^rtjr't') of Eq. P. 1 is then given by 

2 4> k (?) 4> k (?')[q k (t),p k (t')] = Z 4> k (?) 4> k (r') f { b k (t),bj(f)] + [b k (f),bj(t) j 

(F. 19) 
or 

C e (rt;r't') = y {c^Ft; Ft'J+C^Ft', t)}. (F. 20) 

Finally, let us observe that our commutator specification has favored observables 
of the kind ^ (r, t) and ^^(F, t) discussed above where the operator character is put 

on the time amplitudes. Equivalently, we may construct spatially dependent operators 
to advantage in certain cases. The corresponding commutators can be formed and 
related similarly. 


157 


APPENDIX G 


Direct Calculation of Density Operators from Fields 

We shall give an explicit proof that the procedure leading to the construction of den- 
sity operators for different receiver configurations as described in section 5. 5. 1 
(Part I) is indeed correct. Our discussion should also demonstrate how a sum of inde- 
pendent quantum observables may be described by a single density operator in the man- 
ner of section 3. 1.6 (Part I). We shall proceed rather rapidly, but the details can be 
filled in without difficulty. 

We wish to show that density -operator representations can be calculated directly 
from a statistical specification of 4' 0 p(r,t). The basic point to observe is that for a 
field 4 J Q p(r, t) of the kind in Eq. 193 there are infinitely many Schrodinger photon opera- 
tors b with 
n 


b , b^, 
n n 1 


fb , b ,1 = 0 
L n n ,J 


(G. 2) 


so that ill (r, t) is a linear combination of the b . Thus the linear functional 
op n 

a k = / 4' op (?. t) W k (r,t) drdt 


(G.3) 


is also a linear combination of the b . We write 

n 


a. = Z L b . 
k kn n 

n 


(G. 4) 


Suppose that the set a^ obeys 


_ a k’ 4' 


(G. 5) 


K- a k-] = °- 


(G. 6) 


The transformation matrix L which is defined by 


a = Lb 


(G. 7) 




(G.8) 


is thus unitary from (G. 5)-(G. 6). 

Each of the b n is described by a P -distribution, P n (P n 'Pn)’ so that the density 
operator describing the field ^Qp^ t) can be generally given by 
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(G.9) 


P = n ; P 1 > n -Ol |i n>< |3 nl 


Introducting the antinormal ordering operator 

6 <w ‘H-O * Mp„i 

and making a transformation to a k and the corresponding a^, we have 

- = ‘ik» k . i ‘iK ) " “ 2 « k 6 <» k -« k > 6 (4- k )- 

th *” 1 

Here is the nk tr 1 element of L , the inverse of L. We can now write 
nk ~ 


= j4 


II P (z V , o, , Z a*V 

n n U nk k k nk k ) 


(G. 10) 


(G. 11) 


(G. 12) 


On the other hand, the P -distribution of (a, a) as calculated by the procedure of sec- 
tion 5. 5. 1 (Part 1) will turn out to be 


P( 


a, a*) = II P ( 
“ - n n \ 


2 f.a,, X { . 

k nk k k nk 


* * \ 
nk G k J 


(G. 13) 


so that 


p = 3/ P(a, a 


(G. 14) 


is the same as (G. 12). It can be seen that all that we have done is to make a change of 
variables in p. When the transformation is unitary the a variables possess properties 
exactly identical to the p variables. As the properties of a can be obtained directly 
from the statistics of t) we can give p(a, a^) without the knowledge of L and 

p(b, b^). 

” F 

When the vector a is finite dimensional it may be possible to extend it to an infinite 

dimensional vector a which is unitarily related to b. In such a case the P -distribution 

F 

of the finite dimensional a is clearly 


P(£ F -£ F *) = / 11 d 2 c. P(o, q*) 

i^F 1 

and the corresponding density operator will be 


(G. 15) 


F F F* . 

P = j*P (£ ,« ) = tr -{ ai) i^F} p 


(G. 16) 


F F 

and so is the correct reduced density operator for a . In particular, when a = a is 
one -dimensional the density operator for "a" can be constructed by our procedure. 
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In the situation 


■l a* 

V a k' 


* 0 


k * k', 


(G. 17) 


or in addition 


[ a k> a k'l * 0 k^k'. 


(G. 18) 




our procedure of calculating p(a, a') may not be valid, since in (G. 13) there will be an 
additional determinantal factor that cannot be calculated in general. We cannot there- 
fore readily obtain results independent of specific L. It seems that more specific sets 
of eigenstates of a need be constructed in this case from those of b. We may then be 
able to determine the form of p(a, a^) more generally. Otherwise more detailed infor- 
mation of + (?. t) will be required; for example, we may need its explicit expansion 

op a. 

in terms of b and the density operator p(b, b'). Further discussion will not be made. 
Note that the canonical representation of Eq. 385 can always be employed. 
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APPENDIX H 


Mathematical Definitions 

We shall now define the major mathematical terms and notation used in the text. 

137 

Some of their more elementary properties will also be mentioned. Luenberger is 

probably the single reference that contains most of our definitions. For the others and 

for more details Akhiezer and Glazman, 138 Schatten, 138 Riesz and Sz-Nagy, 138 and 
87 

Freidman may be consulted. 

If x is a number of the set S, we write x e S. If V is a subset of S, we write V C S. 
If V C S and V # S, then V is a proper subset of S. The set of real numbers will be 
denoted TR, and the complex numbers $. If S is a set of real numbers bounded below, 
then there is a largest y €E TR such that x 5* y for all x 6: S. The number y is called 
the greatest lower bound or infimum of S and is denoted inf. x. The notation V means 
"for all." xeS 

Let L be a linear space over 1R or $. A set n C L is said to be convex if, for 
a given Xj.x^ 6E £2 all elements of the form cXj + (l-ct)x,, with 1 » a > 0 are in £2. A 
set C in a linear space is said to be a cone if x G £2 implies sx£fi for all a > 0. A 
convex cone is a set that is both convex and a cone. Let P be a convex cone in L. For 
x, y e L we write x » y if x, y e P. The cone defining the > relation is called the posi- 
tive cone in X. Let L, M be two linear spaces. Linear transformations are abbreviated 
here as transformations. 

Let L, M be normed linear spaces. An operator A on L to M is bounded if there 
is a constant m such that || Ax|| ^ m|| x|| for all x £ L, where the norm is denoted as 
usual by || ||. An operator is bounded if and only if it is continuous. If M = $, then 

bounded operators from L to M are called bounded linear functionals . If M = TR, it 
is simply called a functional. Let L be a normed linear space, the space of all bounded 
linear functionals on L is called the dual space of X and is denoted L with ele- 

jjc 

ments x . We also use the star notation for complex conjugates of x €E C. No confusion 
is possible, however. Given a normed linear space with a positive cone P C L, one 
defines a natural corresponding convex cone P in L byP ={x |x x^oVxE P}. 

Let X be a linear space and let Z be a linear space having cone P as the positive cone. 
A mapping G from X to Z is convex if the domain £2 of G is a convex set and if 
G[aXj + (l-a)x 2 ] aG(Xj) + (1-a) G(x 2 ) for all Xj,x.,e ST and all a, 0 < a <1 (see partic- 
ularly Luenberger * 3 ^ for these definitions). 

Let L be a normed linear space. It becomes a Banach space if it is complete with 
respect to the metric induced by the norm. A Banach space becomes a Hilbert space 
if an inner product ( , ) can be defined which gives the norm. A normed linear space 
is separable if it contains a countable dense subset. Two vectors Xj,x 2 in a Hilbert 
space 3C are orthogonal if (x^,x 2 ) = 0. Two subsets K^.K^ of 3C are orthogonal if 
Xj and x 2 are orthogonal V XjEKj, x 2 e K 2 - 
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Let X be an operator defined on a dense domain D v C 3C, 3C a Hilbert space. Then X 

has an adjoint operator X* with domain D__. = {g|(Xf, g) = (f, g') for some g' G 3C and every 

, f X 

f G D^j-. X is called Hermitian if X 1 is an extension operator of X and is self-adjoint 

if X = X^. A self-adjoint operator X is positive semidefinite if (f, Xf) > 0 and positive 

definite if {f, Xf) > 0 for all f G Let {| x^)} be a complete orthonormal set of vectors 

in a separable Hilbert space 3€. Then the trace of an Y is defined as 2 (x.,Yx.) and 

i 

and is denoted tr. Y. An operator X on 3C is completely continuous if it maps every 
bounded set into a relatively compact set on 3C. All bounded operators are completely 
continuous on a finite dimensional Hilbert space. An operator X with tr. X^X < oo is 
of the Hilbert- Schmidt class X of the trace class if I tr. x| < oo. A Hilbert-Schmidt 


operator is necessarily completely continuous and a finite-trace operator is necessarily 
Hilbert-Schmidt. Completely continuous self-adjoint operators have spectral resolu- 
tions exactly analogous to finite dimensional Hermitian matrices. A projection oper- 

2 

ator P on is an idempotent {P =P) self-adjoint bounded operator. Two projection 
operators P., P~ are said to be orthogonal if P. P = 0. Two projection operators are 

^ XL* g — 

orthogonal if and only if their ranges are orthogonal (see Freidman, Akhiezer and 
Glazman, 135 and Schatten 13 ^ for these definitions). 
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APPENDIX I 


Optimization Conditions and Proof of Theorem 15 


We shall briefly consider some general optimization methods that we employ in 
establishing our optimal detector specification. The proof of Theorem 15 will also be 
given. 

First, we state the general convex programming duality theorem which is the major 

tool that we use in the proof of Theorem 15. The proof of this duality theorem has been 
• 1 37 

given by Luenberger. Relevant definitions of the terms can be found in Appen- 
dix H. 

Theorem I. 1 

Let f be a convex functional defined on a convex subset ft of a linear space X, and 
let G be a convex mapping of X into a normal space Z. Suppose there exists an 
Xj G X such that G(Xj) < 0 and that 

v p. = inf. {f(x)|G(x) <0, x G ft} 
o 

is finite. Then 

inf. f(x) = max inf. {f(x)+Z*G(x)} 

G(x)<0 z *j 0 x£ft 

x e ft 

* * * 
for x G Z and the maximum in the right is achieved by some Z Q > 0. If the infinum 

at the left is achieved by some x q G ft, then 

z*G( Xo ) • 0 

Z"' 

s{c 

and x minimizes f(x) +Z G(x), xG ft. 
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In the proof of Lemma 3 we need the following projection theorem. 

Theorem I. 2 

Let x be a vector in a Hilbert space and K a closed convex subset of the space. 
Then there is a unique vector k Q G K such that 



for all k G K. 

The following gradient operator method can also be used to obtain the necessary 
conditions for optimality in our situation. The essence of our method lies in the 
observation that a bounded linear operator defined everywhere on a separable Hilbert 
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space can be represented by a discrete infinite matrix. 1 Let 38 denote the Banach 
space over $ of all bounded linear operators on 3C. Then the operator X. G @ is com- 
pletely specified by the infinite matrix 

X = (x..) 

- i] 

X i j = (i|x|j>. 


where {ii) } is any complete orthonormal set. 

Consider a nonlinear real -valued functional f(X, X^) defined on 38 . We can clearly 

regard f(X,X^) as f(x ..,x.. V a real -valued function of infinitely many complex variables 

x.., x... To facilitate treatment, we further write 
ij ij 

x. . = xf. + ix?. (I. 1) 

i] . i] ij 

r I — -+ 

where x^. and x^ are the real and imaginary parts of x^. Thus we can consider f(X, X ) 


fe.4). 


that is, a real -valued function of countably infinitely many real variables. It is then 
clear that in order for f(X,X^) to achieve an extremum we must have 


— t -= 0, \A(i,j), 

**« 


under the assumption that f has continuous first partial derivatives. In these varia- 
tions we have to regard the x„ as completely independent, that is, we vary each x.j 
independently. 

Condition (I. 2) can be put in a much more suggestive and useful form by introducing 
the "gradient operators 11 

8f(X,X t ) 3f(X,xt) 9f(X, Xt) 8f (X, X^ ) 


whose matrix representations are 
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9f\ 

9f 

9X7. 

9x r . 


1] 

9f \ 

- JL. 

9X 7. . 

9xf. 

— iJ 

ij 


so that condition (I. 2) can be written in operator form explicitly independent of repre- 
sentation 

= » =0 . 

9X r 9X 9X 9X ' 


It can be easily seen that just setting 
9f _ „ 


is already equivalent to (I. 2). Note that 9f/9X is an operator in . In actual calcula- 

■>£ 

tion of these first derivatives we can vary x.. and x.. as if they are independent real 

variables because from (I. 1) we consider x. . as a function of both x.. and x.., so 

13 i] i] 

that (I. 2) gives 


Wlii ®v. 

Finite dimensional gradient matrices of the kind (I. 3) have been introduced before 
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for application in matrix differential equations. Conditions of the type 


have also been used for finite dimensional vector optimization. 1 Our gradient oper- 
ator or operator derivative of a functional is closely connected with the Frechet 

1 37 138 

or Gateaux derivative ’ in a normed linear space. The simplicity that we have 
achieved here is that (I. 5) is a direct condition on the elements of SB . 

Care has to be exercised in evaluating the gradient operators 9f/9X. They are 
not entirely similar to ordinary differentiation and, in fact, we do not have all of the 
derivative formulas for various forms of f. In actual cases f has to be written down 
as an explicit function of x.. and the derivative with respect to x.. taken in the usual 
manner. The resulting function is identified with the ij element of an appropriate 
matrix, which can then be expressed as an operator independent of representation. 

In the presence of constraints 


F a (X,xh = 0 , 


a = 1, .... N 


for a set F of arbitrary transformations on SB , we can introduce the Lagrangian 
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(1.7) 


L = f/xYx 1 . ) + 2 2 X G F C (xf-.x 1 .), 

V V' a = 1 m, n mn mnV ^ W 


where F° is the (m, n) element of F G , and X G is a set of scalar Lagrange mul- 
mn mn 

tipliers. A necessary condition for X to be a local extremum is then analogous to 

, 146-148 

the usual case 


9L 

9x r . 


9L 

9X 1 . 


= o. 


( 1 . 8 ) 


Introduce operators X. such that 


(X G ) = X G . 
mn mn 


Then we can write (I. 7) in the operator form 


L = f(X.X^) + 2 tr X G F G . 


(1.9) 


The Lagrange multipliers X have to insure that 
tr. X G F a £ TR V a. 

Our condition (I. 8) can now be compactly written as 
9L 


9X 


= 0. 


Applying (I. 10) to Problem III of Section II-A, we let 


(I. 10) 


L = 2 tr. -rr.p. - tr. X ( 2 it. - 1) - 2 tr. ir.ir.X 1 ^, 


r I 


2 


2 


<ij> 


2 i 


(I. ID 


where (ij) denotes a sum over all (i,j) for which i + j. Taking the derivative (I. 10), 
by a straightforward evaluation, we have 




X - p. = 2 tr. X^tt.. 

J U*j) 


(I. 12) 


Multiplying both sides of (I. 12) by u\, we obtain 


(X-p .)tt. = 0 

V 2 


(I. 13) 


so that 


X = 2 -rr.p. = 2 p.ir.. 

j j J J 


(I. 14) 
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This affords the proof of Theorem 15. 

Implicit in our method are various continuity properties that guarantee existence 
of the relevant quantities. With the simple functionals that we have in Problem III 
no trouble should arise in this connection. Note that Theorem 15 can also be 
proved by general Lagrange multiplier theorems with derivatives inter- 
preted in the Frechet or Gateaux sense. Our approach here has the virtue of being 
more simple and direct. 
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